-
Notifications
You must be signed in to change notification settings - Fork 0
Python tools for use with the Basis Tech Rosette API entity extraction, categorization, sentiment analysis etc
License
sidprobstein/rosette_api
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
=========== rosette-api =========== Python tools for use with the Basis Technology Rosette API enrich.py: Enrich json files trytext.py: Template for storing/analyzing responses from Rosette APIs enrich.py ---------- python enrich.py [-h] [-o OUTPUTDIR] [-r RESPONSEDIR] -k KEY filespec Arguments --------- -o OUTPUTDIR specifies the subdirectory where enriched files are written; default: 'enriched' -r RESPONSEDIR specifies the subdirectory where responses from Rosette API are saved; default: 'responses' -f specifies one or more text field(s) in the json to pass to Rosette API -k specifies the rosette API key filespec must be the path to one or more json files Example ------- python enrich.py -k 12345 -f subject -f body *.json Operation --------- Enrich.py iterates through each input file, combines specified fields and sends them to Rosette API for entity extraction. (The fields will be specified on the CLI in a future version.) The response from Rosette is written out to a file of the same name as the input file, but located in the response subdirectory. Enrich.py then filters the entities to those with confidence > 0.1 and groups them into lists by type. Finally, it merges those lists with the input file, and writes the result into the enriched subdirectory. Enrich.py will not overwrite a saved response or a file in the enriched subdirectory. This is intended to allow modification of filtering and mapping logic (starting with '# filter/map entities' in the module) without calling Rosette API live each time. Delete the enriched file after making modifications to make enrich.py create a new version using the saved response. To obtain an updated response from Rosette API, delete the response file. Note that it only makes sense to do this if your input file has changed -- or Rosette has. Notes ----- * Only Entity Extraction is currently supported, use TryText.py to explore others trytext.py ---------- python trytext.py -k KEY Arguments --------- -k specifies the rosette API key Notes ----- * Review the code to see how to change the sample text and analyze the response
About
Python tools for use with the Basis Tech Rosette API entity extraction, categorization, sentiment analysis etc
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published