Not accurately identifying itself critical
Sometimes feed apps provide very little or inaccurate information about themselves to websites when requesting their content, which is what this application is doing. This is usually done to try to blend in with other traffic on the website, stay less noticeable, and decrease the chance websites will classify it as a bad bot trying to bypass a website's restrictions.
Why it's a problemFlying under the radar may be beneficial at times, but when done by a feed reader, it can look very suspicious. So websites are likely to block the application from obtaining feed content for not being identifiable.
What it means for usersIf a website blocks this application for being unidentifiable, which is likely, feeds used in the app won't work.
How to fix itWhen requesting feeds from websites, this app should properly identify itself using a unique User Agent header, with sufficient information so that websites always know what it is.
Requesting feed content from the wrong location critical
It's common for feed applications to grab content from some unconventional location on a website to generate a feed because the website doesn't provide it. However, even though the site already tells this app where feeds are located, the app is still attempting to extract content from other areas on the website that clearly aren't feeds.
Why it's a problemWhen a feed app doesn't request content from the right location or tries getting content from areas on a site not designated for feed consumption, the activity looks suspicious. This is likely to cause website owners to block this app from accessing its content.
What it means for usersFeed content obtained from the wrong location can be unpredictable and may not display correctly in the app. If websites block this app because of the behavior, which is likely, feeds won't work at all.
How to fix itWhen the app visits a webpage to retrieve feed content, it should:
-
Get the website's feed location from the
autodiscovery
link
element of the page and use that location for all subsequent feed requests - Update any attempts by its users to subscribe to the incorrect location in the application to the correct new location without making any further requests to the website for this info
- Ensure the request isn't made to a location on the website that has been explicitly disallowed in its robots.txt file