New CAIDA Prefix-to-AS Mapping Data Set
Monday, November 14th, 2022 by Bradley HuffakerSince May 9th, 2005, CAIDA has produced a data set that maps IPv4 prefixes (and later also IPv6 prefixes) to the AS (Autonomous System) originating that prefix into the global BGP routing system, as observed via a single BGP data collector of the Route Views data collection system. We have called this data set “RouteViews Prefix to AS”. We used CAIDA’s straighten_rv script to filter the RIB (routing information base file used as input data. We will discontinue this data set on December 31st, 2022 an replace it with a new more complete data set that we call CAIDA’s Prefix-to-AS data set.
CAIDA will use the BGPStream software package (and in particular the bgpview library) to include data from all available BGP collectors from both of the primary global publicly available collection systems: Route Views and RIPE NCC Routing Information Service. We will backfill Prefix-to-AS data to 2000. As part of this transition, CAIDA will no longer use straighten_rv to preprocess AS paths. We will create two files: an annotated file with all the data observed in BGP, and a simple file that filters out data of no interest to many researchers as described below.
Annotated files. The annotated file will include information about the stability and visibility of prefixes by different peers and collectors. Individuals who wish to produce a more refined mapping can fairly easily filter this data. The table below compares the older “Routeviews2” (a single Route Views collector) and the new annotated CAIDA Prefix-to-AS dataset (all collectors from both RIPE RIS and Route Views) for 1 June 2022. Most (99.6%) ASes and (87.2%) prefixes appeared in both datasets. Note that multiple ASNs announced the prefix 0.0.0.0/0, we exclude it since it covers the entire IPv4 address space.
ASN | filtered | Routeviews2 only | Routeviews+RIPE | both | total | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Multiorigin/set | 128 | 4.10% | 1552 | 49.73% | 1441 | 46.17% | 3121 | ||||||
public | 0 | 0.00% | 295 | 0.40% | 73294 | 99.60% | 73589 | ||||||
reserved | X | 0 | 0.00% | 1379 | 88.97% | 171 | 11.03% | 1550 | |||||
Prefix | filtered | Routeviews2 only | Routeviews+RIPE | both | total | ||||||||
larger then /8 | X | 0 | 0.00% | 1 | 100.00% | 0 | 0.00% | 1 | |||||
private | X | 0 | 0.00% | 504 | 84.85% | 90 | 15.15% | 594 | |||||
public | 0 | 0.00% | 138498 | 12.81% | 942469 | 87.19% | 1080967 |
Simple files. The simple file will exclude very large prefixes, e.g., with mask lengths < 8, private addresses (RFC 1918), or prefixes announced exclusively by reserved ASNs (Special-Purpose ASN). The resulting simple prefix-to-ASN mapping covers 99.7% of the address space captured by the annotated file. In the table below (also reflecting 1 June 2022), 0.94% of prefixes and 0.42% of addresses had an additional origin AS that was not also observed in the Routeviews2-only dataset. This reflects the expanded visibility of more collectors and peer. 4.92% of CAIDA’s prefixes and 1.82% of addresses were not covered by Routeviews2-only prefix2as. Overall the combined data set provides visibility of 5.86% of prefixes and 2.24% of addresses not covered by routeviews2-only.
CAIDA’s Prefix to AS “simple” (99.7% of addresses observed in annotated files)
ASN type | prefixes | addressses | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
source | agreement | Routeviews2 only |
Routeviews + RIPE |
number | group % | all % | number | group % | all % | |||||||||
both | different | multiorigin | multiorigin | 626 | 11.43% | 0.11% | 1241088 | 9.65% | 0.04% | |||||||||
public | multiorigin | 4816 | 87.95% | 0.82% | 11442617 | 88.93% | 0.37% | |||||||||||
set | multiorigin | 34 | 0.62% | 0.01% | 183039 | 1.42% | 0.01% | |||||||||||
5476 | 100.00% | 0.94% | 12866744 | 100.00% | 0.42% | |||||||||||||
both | same | multiorigin | multiorigin | 9869 | 1.79% | 1.69% | 12609229 | 0.42% | 0.41% | |||||||||
public | public | 540032 | 98.20% | 92.45% | 2988739528 | 99.58% | 97.35% | |||||||||||
set | set | 8 | 0.00% | 0.00% | 9216 | 0.00% | 0.00% | |||||||||||
549909 | 100.00% | 94.14% | 3001357973 | 100.00% | 97.76% | |||||||||||||
Routeviews+RIPE | N/A | multiorigin | 1884 | 6.55% | 0.32% | 908601 | 1.63% | 0.03% | ||||||||||
public | 26856 | 93.44% | 4.60% | 54919321 | 98.37% | 1.79% | ||||||||||||
set | 2 | 0.01% | 0.00% | 2816 | 0.01% | 0.00% | ||||||||||||
28742 | 100.00% | 4.92% | 55830738 | 100.00% | 1.82% |
You can find the new CAIDA Prefix-to-AS Mapping Data Set here.