Blog Module:Xignite
From ExtractValue Wiki
The Xignite sub-module of the Blog Module refers to the functions related to displaying and interacting with Xignite News Feeds, including feeds from the Blog Module.
Contents |
Objectives
To mimic the functionality of Futuresource News, which takes the same or similar data feeds.
- Receive full text RSS feeds on relevant news
- Archive this news in a search engine friendly manner
- Allow this news article to be easily interlinked
- Enable all levels of interactivity as with the Blog Module (commenting, RSS subscribing, etc.)
Functional Specs
- Xignite Login
- User: xignite@correlo.com
- Pass: xignite
- News feeds are stored under Groups
- All URLs must follow the URL Schema for News Feeds
- See Talk:News Module for the source code of sample XML feeds
Version 1.0
Retrieval & Storage
| Group | URL | Article tags |
| Reuters | http://feeds.reuters.com/reuters/businessNews?format=xml | business news, news feed |
| Reuters | http://feeds.reuters.com/reuters/features/personalfinance?format=xml | lifestyle, money, news feed |
- Create Group: Reuters
- Group Type: News Service
- Originator: Correlo
- No Products
- No Admin
- Retrieve a XML feed of market headlines. Parse for:
- title
- description
- pubDate
- feedburner:origLink
- Generate the URL of http://domain/news/reuters/article-title
- Replace spaces with a hyphen (-), per the URL Schema
- If there are duplicate article titles (there really shouldn't be), add (1), (2), (3), ... after it
- Use feedburner:origLink in GetMarketNewsDetails to retrieve full text articles for every headline (obviously, do not pull the same feed twice. That would be an unnecessary hit of Xignite's API)
- Parse for and save <content> from GetMarketNewsDetails:
- Display the news feed on http://domain/news/ and /groupname/ pages
- With title, description, and pubDate
- Link to the article at ../news/groupname/article-title
Proprietary RSS Feed
- Create a RSS feed for the news of each Group
Version 2.0
Austinguu 18:19, 30 September 2008 (UTC) says: Sybrant, please do not quote or build Versions 2.0 and 3.0 yet We have not finished spec'ing them.
Selective Article Storage
The objective is to hit the Xignite API only when needed: on URL visit and on hyperlink click. That way, we can pull more feeds without having to worry about the cost of licensing the data unnecessarily high amounts of data. Additionally, the application needs to make sure that the full text is not retrieved for bots as well.
- On RSS or XML feed retrieval (not full text), generate the appropriate News article link per the URL Schema
- When the URL is visited (http://domain/news/reuters/article-title...
- Check to see if the full text exists in the database
- Check the user agent to see if it is a bot (like Google). If it is, then only display the information currently in the database.
- If the user agent is not a bot and the full text does not exist in the database, then do not retrieve the full text.
Version 3.0
Copy and Paste Feed
- Build the ability for a Group to paste the URL of a RSS feed into a form
- This RSS feed will then be displayed under the Group News page
