You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
However, frequently the same article appears in more than one of these feeds.
For example, here is an identical record that appears in both technology and business right now (every field is identical, including the URL):
<item>
<title>Google lifts ban on using AI for weapons</title>
<description>
The tech giant has updated the principles governing its development of artificial intelligence.
</description>
<link>https://www.bbc.com/news/articles/cy081nqx2zjo</link>
<guid isPermaLink="false">https://www.bbc.com/news/articles/cy081nqx2zjo#0</guid>
<pubDate>Wed, 05 Feb 2025 00:43:40 GMT</pubDate>
<media:thumbnail width="240" height="135" url="https://ichef.bbci.co.uk/ace/standard/240/cpsprodpb/9489/live/6de4ebf0-e351-11ef-a08f-756c6bc158bd.jpg"/>
</item>
My understanding is that Nextcloud News generates a GUID/hash for each article to avoid presenting duplicates to the user. This logic doesn't appear to apply across multiple feeds, however.
Steps to Reproduce
Explain what you did to encounter the issue
Subscribe to http://feeds.bbci.co.uk/news/business/rss.xml
Subscribe to http://feeds.bbci.co.uk/news/technology/rss.xml
Sync feeds
Ensure you're selecting on All articles
Scroll to see duplicates (if you don't, just add other BBC feeds, a duplicate is bound to show up)
System Information
News app version: 25.2.0
Nextcloud version: Nextcloud Hub 8 (29.0.0)
Cron type: system cron
PHP version: 8.2.18
Database and version: mysql 10.6.17
Browser and version: Mozilla Firefox Flatpak 134.0.2 (64-bit)
OS and version: Debian GNU/Linux 12 (bookworm) x86_64
Contents of nextcloud/data/nextcloud.log
N/A: This is a logical/UX issue afaik.
Contents of Browser Error Console
Read http://ggnome.com/wiki/Using_The_Browser_Error_Console if you are unsure what to put here
N/A: No logs in console!
Proposal
Would it be feasible to check this hash globally, and see if that hash has already been processed before.
The article should still appear in each individual feed when they're accessed. These kinds of duplicates would only be hidden when selected on a category/tag featuring multiple feeds, or when selected on All articles.
Considerations
At the top of the article, it states from {Source} (from BBC Technology or from BBC Business). If there are duplicate articles, it may be worth thinking how to approach this UI. Can it just use whichever it found first? Should it list all feeds the article appeared in (from BBC Technology and BBC Business)?
The text was updated successfully, but these errors were encountered:
IMPORTANT
Read and tick the following checkbox after you have created the issue or place an x inside the brackets ;)
Explain the Problem
I'm subscribed to multiple news feeds from the BBC:
However, frequently the same article appears in more than one of these feeds.
For example, here is an identical record that appears in both
technology
andbusiness
right now (every field is identical, including the URL):My understanding is that Nextcloud News generates a GUID/hash for each article to avoid presenting duplicates to the user. This logic doesn't appear to apply across multiple feeds, however.
Steps to Reproduce
Explain what you did to encounter the issue
http://feeds.bbci.co.uk/news/business/rss.xml
http://feeds.bbci.co.uk/news/technology/rss.xml
All articles
System Information
Contents of nextcloud/data/nextcloud.log
N/A: This is a logical/UX issue afaik.
Contents of Browser Error Console
Read http://ggnome.com/wiki/Using_The_Browser_Error_Console if you are unsure what to put hereN/A: No logs in console!
Proposal
Would it be feasible to check this hash globally, and see if that hash has already been processed before.
The article should still appear in each individual feed when they're accessed. These kinds of duplicates would only be hidden when selected on a category/tag featuring multiple feeds, or when selected on
All articles
.Considerations
At the top of the article, it states
from {Source}
(from BBC Technology
orfrom BBC Business
). If there are duplicate articles, it may be worth thinking how to approach this UI. Can it just use whichever it found first? Should it list all feeds the article appeared in (from BBC Technology and BBC Business
)?The text was updated successfully, but these errors were encountered: