InfoExtractor License

Creative Commons License

InfoExtractor by Chirag Shah is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License.
Based on a work at www.infoextractor.org.


Enter a web address (URL)

InfoExtractor currently understands the following URLs:
  • YouTube video pages
  • YouTube user profile pages
  • Facebook profiles and pages
  • Wikipedia entries
  • Huffingtonpost posts
  • Blogcatalog blog posts
  • The Heritage Foundation blog (The Foundry)
Upload a file to process

The file should be in plain text format.
Put one URL per line.
See an example file.

Check out our Facebook Harvester, soon to be integrated into InfoExtractor.
Also coming soon - The New York Times Crawler.


HomeAbout InfoExtractorFirefox toolbar Chirag Shah Bookmark and Share