{"id":581,"date":"2010-07-28T06:43:24","date_gmt":"2010-07-28T13:43:24","guid":{"rendered":"http:\/\/blog.caida.org\/best_available_data\/?p=581"},"modified":"2012-04-10T16:18:30","modified_gmt":"2012-04-10T23:18:30","slug":"ip-as-mappings","status":"publish","type":"post","link":"https:\/\/blog.caida.org\/best_available_data\/2010\/07\/28\/ip-as-mappings\/","title":{"rendered":"IP-AS mappings"},"content":{"rendered":"<p>We have performed an <a href=\"http:\/\/www.caida.org\/research\/routing\/prefix_as_comparison\">analysis of the IP-AS mapping obtained from Routeviews\/RIPE collectors<\/a>. <\/p>\n<p>A crucial step in various research efforts that study the Internet infrastructure is to map an IP address to the Autonomous System (AS) to which it is assigned. The most common approach to map IP addresses to ASes is by using BGP table dumps from public repositories such as <a href=\"http:\/\/www.routeviews.org\">Routeviews<\/a> and <a href=\"http:\/\/www.ripe.net\/projects\/ris\/rawdata.html\">RIPE<\/a>. We assign &#8220;ownership&#8221; of an IP address to the AS that originates the longest BGP prefix that matches the IP address. Routeviews and RIPE, however, have multiple collectors, each of which peers with a diverse set of ASes. Consequently, the IP-AS mapping obtained by using the BGP table dump from one collector could be different from that obtained from a different collector. The obvious solution is to aggregate views from as many vantage points as possible to obtain the most complete IP-AS mapping possible. In practice, however, it is common to use data from just one or two collectors, as it greatly simplifies the process of collecting and pre-processing data. The goal of our analysis is to compare different collectors, in terms of the different metrics that we are interested in, viz. address space coverage, IP-AS mapping, unique ASes, unique prefixes, unique more specific prefixes, AS links, and AS paths. Further, we study the utility of adding data from more collectors, in terms of the resulting change in the aforementioned metrics. Finally, we compare the IP-AS mapping from Routeviews and RIPE tables with that obtained from <a href=\"http:\/\/www.team-cymru.org\/Services\/ip-to-asn.html\">Team Cymru&#8217;s whois service<\/a>.<\/p>\n<p><!--more--><\/p>\n<p><img loading=\"lazy\" alt=\"\" src=\"http:\/\/www.caida.org\/research\/routing\/prefix_as_comparison\/images\/all_addr_ch.png\" class=\"alignnone\" width=\"600\" height=\"420\" \/><\/p>\n<p>The figures above show the relative changes in address space coverage when we start with table dumps from Routeviews&#8217; LINX collector (which we choose as the base table, as it provides the largest coverage of IPv4 address space of any single table in Routeviews), and successively add data from collectors in decreasing order of address space coverage. The first plot above shows that data from additional collectors results in less than 1% increase in address space coverage, and the second plot shows that additional collectors incur a change in the IP-AS mapping for fewer than 1% of addresses represented in the tables. <\/p>\n<p><img loading=\"lazy\" alt=\"\" src=\"http:\/\/www.caida.org\/research\/routing\/prefix_as_comparison\/images\/cur_unq_lnk.png\" class=\"alignnone\" width=\"600\" height=\"420\" \/><\/p>\n<p>We repeated the same analysis for other metrics of interest &#8211; unique prefixes, unique more-specific prefixes, unique ASes, unique origin ASes, and unique AS links. We find that additional table dumps yield fewer than 1% additional (previously unseen) ASes and origin ASes, confirming previous reports that most ASes are observable from even a few vantage points. However, for other metrics, additional table dumps matter: adding a table dump can yield up to 4.8% more AS links (shown in the above figure), up to 4.6% more prefixes and 4.7% more specific prefixes than seen in the base table. Furthermore, between 10% and 70% of the more specific prefixes seen in additional table dumps are originated by a different origin AS than in the base table. <\/p>\n<p>We also compared the IP-AS mapping obtained from Routeviews\/RIPE table dumps with that obtained from <a href=\"http:\/\/www.team-cymru.org\/Services\/ip-to-asn.html\">Team Cymru&#8217;s whois service<\/a>. We find that  the difference between the IP-AS mapping from Cymru and that obtained by combining data from all Routeviews and RIPE collectors is small (0.7% of queried addresses returned a different origin AS). 56% of these IP-AS mismatches are due to cases where Cymru and the table dumps return a single, but different AS. A significant number (41%) of mismatches are due to Multi-Origin ASes (MOASes). In particular, 34% of IP-AS mismatches are due to MOASes where the Cymru mapping does not contain one of the ASes returned by the table dumps. <\/p>\n<p>In summary, our findings are reassuring. In terms of IP-AS mapping, using data from just a few of the largest Routeviews\/RIPE collectors is sufficient; adding data from more collectors does not significantly change the IP-AS mapping or coverage of IPv4 address space. Also, using data from Routeviews\/RIPE is not significantly different from using Team Cymru&#8217;s whois service. In fact, in the data we compared, the combination of table dumps from all Routeviews\/RIPE collectors gave a better view of MOAS prefixes than Cymru&#8217;s lookup service. <\/p>\n","protected":false},"excerpt":{"rendered":"<p>We have performed an analysis of the IP-AS mapping obtained from Routeviews\/RIPE collectors. A crucial step in various research efforts that study the Internet infrastructure is to map an IP address to the Autonomous System (AS) to which it is assigned. The most common approach to map IP addresses to ASes is by using BGP [&hellip;]<\/p>\n","protected":false},"author":10,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[1,10,11,8],"tags":[],"coauthors":[],"_links":{"self":[{"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/posts\/581"}],"collection":[{"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/users\/10"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/comments?post=581"}],"version-history":[{"count":57,"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/posts\/581\/revisions"}],"predecessor-version":[{"id":1812,"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/posts\/581\/revisions\/1812"}],"wp:attachment":[{"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/media?parent=581"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/categories?post=581"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/tags?post=581"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/blog.caida.org\/best_available_data\/wp-json\/wp\/v2\/coauthors?post=581"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}