Skip to content

Commit

Permalink
Merge pull request #100 from m453h/chore-update-dependencies-cautions
Browse files Browse the repository at this point in the history
Add version constraint caution for Trafilatura
  • Loading branch information
m453h authored Oct 26, 2024
2 parents 1e542fd + bbbaa9d commit 90be167
Showing 1 changed file with 6 additions and 2 deletions.
8 changes: 6 additions & 2 deletions pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -29,8 +29,12 @@ dependencies = [
# for language detection
"py3langid==0.2.*",
# various content extractors we try to use
"newspaper3k==0.2.*", "goose3==3.1.*", "BeautifulSoup4==4.12.*", "readability-lxml==0.8.*",
"trafilatura==1.8.*", "boilerpy3==1.0.*",
"newspaper3k==0.2.*",
"goose3==3.1.*",
"BeautifulSoup4==4.12.*",
"readability-lxml==0.8.*",
"trafilatura==1.8.*", # must stay below v1.11.* to allow easy extraction of canonical_url
"boilerpy3==1.0.*",
# support
"requests", # leave un-versioned so dependencies can sort of which version is best
"faust-cchardet==2.1.*", # BeautifulSoup4 speedup
Expand Down

0 comments on commit 90be167

Please sign in to comment.