Skip to content

ethangardner/py-keyword-extraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 

Repository files navigation

Description

Reads a text file with one url on each line to scrape the contents of a web page and extract key terms using natural language processing. Built with python.

Requirements

Instructions

Run the script from the command line. There are a few required options

Required Arguments

  • -i, --input the name of the txt file containing the URLS
  • -c, --content the selector for the content region to parse
  • -o, --output the name of the file to be output. Acceptable formats are csv or json.

Optional Arguments

  • -l, --length the minimum length of each keyword returned by the script

About

Extracts keywords from a webpage using the Topia library

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages