{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Use Case Tutorial 1: Well-Connected US Regions\n", "\n", "This is a tutorial on how to find the most well-connected regions of the U.S. via air travel.\n", "\n", "The U.S. Bureau of Transportation Statistics provides data on monthly air travel from all certificated U.S. air carriers and makes it available [here](https://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=258). The 2018 air travel data used for this tutorial can be downloaded [here](https://transtats.bts.gov/ftproot/TranStatsData/403537556_T_T100D_MARKET_US_CARRIER_ONLY.zip). We chose 2018 data to avoid any impact COVID-19 might’ve had on travel.\n", "\n", "We will utilize this data to determine which areas in the U.S. are most well-connected using betweenness centrality." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Data Preprocessing\n", "\n", "Let’s first look at the data.\n", "\n", "First, we’ll need to import some libraries." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import metagraph as mg\n", "import pandas as pd" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let’s see what the data looks like." ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", " | PASSENGERS | \n", "FREIGHT | \n", "DISTANCE | \n", "UNIQUE_CARRIER | \n", "AIRLINE_ID | \n", "UNIQUE_CARRIER_NAME | \n", "ORIGIN_AIRPORT_ID | \n", "ORIGIN_AIRPORT_SEQ_ID | \n", "ORIGIN_CITY_MARKET_ID | \n", "... | \n", "DEST_AIRPORT_SEQ_ID | \n", "DEST_CITY_MARKET_ID | \n", "DEST | \n", "DEST_CITY_NAME | \n", "DEST_STATE_ABR | \n", "DEST_STATE_FIPS | \n", "DEST_STATE_NM | \n", "DEST_WAC | \n", "MONTH | \n", "Unnamed: 26 | \n", "|
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.0 | \n", "410.0 | \n", "0.0 | \n", "616.0 | \n", "WN | \n", "19393.0 | \n", "Southwest Airlines Co. | \n", "13851 | \n", "1385103 | \n", "33851 | \n", "... | \n", "1069302 | \n", "30693 | \n", "BNA | \n", "Nashville, TN | \n", "TN | \n", "47 | \n", "Tennessee | \n", "54 | \n", "6 | \n", "NaN | \n", "
1 | \n", "0.0 | \n", "184.0 | \n", "0.0 | \n", "2592.0 | \n", "WN | \n", "19393.0 | \n", "Southwest Airlines Co. | \n", "14307 | \n", "1430705 | \n", "30721 | \n", "... | \n", "1289208 | \n", "32575 | \n", "LAX | \n", "Los Angeles, CA | \n", "CA | \n", "6 | \n", "California | \n", "91 | \n", "6 | \n", "NaN | \n", "
2 | \n", "0.0 | \n", "87.0 | \n", "0.0 | \n", "2445.0 | \n", "WN | \n", "19393.0 | \n", "Southwest Airlines Co. | \n", "14679 | \n", "1467903 | \n", "33570 | \n", "... | \n", "1025702 | \n", "30257 | \n", "ALB | \n", "Albany, NY | \n", "NY | \n", "36 | \n", "New York | \n", "22 | \n", "6 | \n", "NaN | \n", "
3 | \n", "0.0 | \n", "10.0 | \n", "0.0 | \n", "432.0 | \n", "WN | \n", "19393.0 | \n", "Southwest Airlines Co. | \n", "14730 | \n", "1473003 | \n", "33044 | \n", "... | \n", "1299206 | \n", "32600 | \n", "LIT | \n", "Little Rock, AR | \n", "AR | \n", "5 | \n", "Arkansas | \n", "71 | \n", "6 | \n", "NaN | \n", "
4 | \n", "0.0 | \n", "100.0 | \n", "0.0 | \n", "129.0 | \n", "WN | \n", "19393.0 | \n", "Southwest Airlines Co. | \n", "14747 | \n", "1474703 | \n", "30559 | \n", "... | \n", "1405702 | \n", "34057 | \n", "PDX | \n", "Portland, OR | \n", "OR | \n", "41 | \n", "Oregon | \n", "92 | \n", "6 | \n", "NaN | \n", "
5 rows × 27 columns
\n", "\n", " | PASSENGERS | \n", "ORIGIN_AIRPORT_ID | \n", "ORIGIN_AIRPORT_SEQ_ID | \n", "ORIGIN_CITY_MARKET_ID | \n", "ORIGIN | \n", "ORIGIN_CITY_NAME | \n", "ORIGIN_STATE_ABR | \n", "ORIGIN_STATE_NM | \n", "DEST_AIRPORT_ID | \n", "DEST_AIRPORT_SEQ_ID | \n", "DEST_CITY_MARKET_ID | \n", "DEST | \n", "DEST_CITY_NAME | \n", "DEST_STATE_ABR | \n", "DEST_STATE_NM | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
44447 | \n", "1.0 | \n", "12523 | \n", "1252306 | \n", "32523 | \n", "JNU | \n", "Juneau, AK | \n", "AK | \n", "Alaska | \n", "11545 | \n", "1154501 | \n", "31545 | \n", "ELV | \n", "Elfin Cove, AK | \n", "AK | \n", "Alaska | \n", "
44448 | \n", "1.0 | \n", "12523 | \n", "1252306 | \n", "32523 | \n", "JNU | \n", "Juneau, AK | \n", "AK | \n", "Alaska | \n", "11619 | \n", "1161902 | \n", "31619 | \n", "EXI | \n", "Excursion Inlet, AK | \n", "AK | \n", "Alaska | \n", "
44449 | \n", "1.0 | \n", "12610 | \n", "1261001 | \n", "32610 | \n", "KAE | \n", "Kake, AK | \n", "AK | \n", "Alaska | \n", "10204 | \n", "1020401 | \n", "30204 | \n", "AGN | \n", "Angoon, AK | \n", "AK | \n", "Alaska | \n", "
44450 | \n", "1.0 | \n", "11298 | \n", "1129806 | \n", "30194 | \n", "DFW | \n", "Dallas/Fort Worth, TX | \n", "TX | \n", "Texas | \n", "11292 | \n", "1129202 | \n", "30325 | \n", "DEN | \n", "Denver, CO | \n", "CO | \n", "Colorado | \n", "
44451 | \n", "1.0 | \n", "15991 | \n", "1599102 | \n", "35991 | \n", "YAK | \n", "Yakutat, AK | \n", "AK | \n", "Alaska | \n", "14828 | \n", "1482805 | \n", "34828 | \n", "SIT | \n", "Sitka, AK | \n", "AK | \n", "Alaska | \n", "
\n", " | ORIGIN_CITY_MARKET_ID | \n", "DEST_CITY_MARKET_ID | \n", "PASSENGERS | \n", "INVERSE_PASSENGER_COUNT | \n", "
---|---|---|---|---|
0 | \n", "30005 | \n", "30349 | \n", "4.0 | \n", "0.250000 | \n", "
1 | \n", "30005 | \n", "31214 | \n", "10.0 | \n", "0.100000 | \n", "
2 | \n", "30005 | \n", "31517 | \n", "193.0 | \n", "0.005181 | \n", "
3 | \n", "30005 | \n", "35731 | \n", "7.0 | \n", "0.142857 | \n", "
4 | \n", "30006 | \n", "30056 | \n", "5.0 | \n", "0.200000 | \n", "
\n", " | AIRPORT | \n", "CITY_NAME | \n", "
---|---|---|
CITY_MARKET_ID | \n", "\n", " | \n", " |
30005 | \n", "{05A} | \n", "{Little Squaw, AK} | \n", "
30006 | \n", "{06A} | \n", "{Kizhuyak, AK} | \n", "
30007 | \n", "{KLW} | \n", "{Klawock, AK} | \n", "
30009 | \n", "{HOM, 09A} | \n", "{Homer, AK} | \n", "
30010 | \n", "{1B1} | \n", "{Hudson, NY} | \n", "
\n", " | BETWEENNESS_CENTRALITY_SCORE | \n", "AIRPORT | \n", "CITY_NAME | \n", "
---|---|---|---|
CITY_MARKET_ID | \n", "\n", " | \n", " | \n", " |
32575 | \n", "494817.0 | \n", "{LAX, SMO, SNA, HHR, LGB, BUR, ONT, VNY} | \n", "{Santa Ana, CA, Los Angeles, CA, Van Nuys, CA,... | \n", "
30559 | \n", "465677.0 | \n", "{BFI, SEA, LKE, KEH} | \n", "{Kenmore, WA, Seattle, WA} | \n", "
30299 | \n", "349232.0 | \n", "{ANC, DQL, MRI} | \n", "{Anchorage, AK} | \n", "
31703 | \n", "337885.0 | \n", "{LGA, ISP, EWR, JRB, HPN, JRA, JFK, TSS, SWF} | \n", "{Islip, NY, New York, NY, Newark, NJ, Newburgh... | \n", "
30977 | \n", "206250.0 | \n", "{LOT, GYY, ORD, PWK, DPA, MDW} | \n", "{Chicago/Romeoville, IL, Chicago, IL, Gary, IN} | \n", "
30397 | \n", "144922.0 | \n", "{FTY, ATL, PDK, QMA} | \n", "{Kennesaw, GA, Atlanta, GA} | \n", "
30194 | \n", "121807.0 | \n", "{RBD, ADS, FWH, FTW, AFW, DAL, DFW} | \n", "{Dallas/Fort Worth, TX, Dallas, TX, Fort Worth... | \n", "
30325 | \n", "107586.0 | \n", "{APA, DEN} | \n", "{Denver, CO} | \n", "
31517 | \n", "90409.0 | \n", "{FBK, EIL, MTX, A01, FAI} | \n", "{Fairbanks/Ft. Wainwright, AK, Fairbanks, AK} | \n", "
30113 | \n", "75327.0 | \n", "{BET} | \n", "{Bethel, AK} | \n", "
30070 | \n", "62402.0 | \n", "{KDK, ADQ} | \n", "{Kodiak, AK} | \n", "
30154 | \n", "56833.0 | \n", "{ACK} | \n", "{Nantucket, MA} | \n", "
32467 | \n", "48068.0 | \n", "{FXE, FLL, OPF, TMB, MIA, MPB} | \n", "{Miami, FL, Fort Lauderdale, FL} | \n", "
32457 | \n", "46094.0 | \n", "{OAK, CCR, SFO, SJC} | \n", "{San Jose, CA, San Francisco, CA, Oakland, CA,... | \n", "
30466 | \n", "45699.0 | \n", "{AZA, AZ3, PHX, GYR, SCF} | \n", "{Goodyear, AZ, Phoenix, AZ, Glendale, AZ} | \n", "