Skip to content

Latest commit

 

History

History
65 lines (55 loc) · 1.52 KB

README-contents.md

File metadata and controls

65 lines (55 loc) · 1.52 KB

Contents

An inaccurate aspirational table of contents.

  • geo

    • countries
    • timezones
    • geonames_places
    • geonames_postal
    • iso_currencies
    • iso_languages
    • iso_langscripts
    • natural_earth
    • https://github.com/zmaril/Visualization-Data[GeoJSON world boundaries]
    • cia_factbook
    • geo/wikigrounder_toponyms (5 MB/23 MB) -- grounded place names with rough categories (county, province, etc) download
    • UFO sightings
  • stats

    • integers -- a table of integers; surprisingly useful for many database tasks and annoying to generate
  • airline flights

    • airline_flights
    • airline_airports
    • airline_airlines
    • airline_airfares
  • access logs

    • weblogs_waxydotorg
    • weblogs_worldcup
  • text

    • demo_texts -- a few short text passages for testing
    • gutenberg
    • twl/CSW(sowpods) -- lang/corpora/scrabble
    • BNC --
    • quackle -- misc/words_quackle
    • wordnet
    • dirty_words
    • nltk
    • stopwords
    • color_names
    • fakered_customer_data
    • lorem
  • sports

    • parks
    • teams
    • franchises
    • players
    • games
  • wikipedia

    • wikipedia_articles -- article text
    • wikipedia_pageinfos -- article metadata
    • wikipedia_pagelinks -- pagelinks
    • wikipedia_pageviews -- pageview counts by hour
    • (geolocated)
    • (geoimplied)
    • (dbpedia)
  • weather

    • weather_hourly -- hourly global
    • weather_stations -- weather stations