Blog Module:Xignite

From ExtractValue Wiki

Jump to: navigation, search

The Xignite sub-module of the Blog Module refers to the functions related to displaying and interacting with Xignite News Feeds, including feeds from the Blog Module.

Contents

Objectives

To mimic the functionality of Futuresource News, which takes the same or similar data feeds.

  • Receive full text RSS feeds on relevant news
  • Archive this news in a search engine friendly manner
  • Allow this news article to be easily interlinked
  • Enable all levels of interactivity as with the Blog Module (commenting, RSS subscribing, etc.)

Functional Specs

Version 1.0

Retrieval & Storage

Group URL Article tags
Reuters http://feeds.reuters.com/reuters/businessNews?format=xml business news, news feed
Reuters http://feeds.reuters.com/reuters/features/personalfinance?format=xml lifestyle, money, news feed


  1. Create Group: Reuters
    1. Group Type: News Service
    2. Originator: Correlo
    3. No Products
    4. No Admin
  2. Retrieve a XML feed of market headlines. Parse for:
    1. title
    2. description
    3. pubDate
    4. feedburner:origLink
  3. Generate the URL of http://domain/news/reuters/article-title
    1. Replace spaces with a hyphen (-), per the URL Schema
    2. If there are duplicate article titles (there really shouldn't be), add (1), (2), (3), ... after it
  4. Use feedburner:origLink in GetMarketNewsDetails to retrieve full text articles for every headline (obviously, do not pull the same feed twice. That would be an unnecessary hit of Xignite's API)
    1. Parse for and save <content> from GetMarketNewsDetails:
  5. Display the news feed on http://domain/news/ and /groupname/ pages
    1. With title, description, and pubDate
    2. Link to the article at ../news/groupname/article-title

Proprietary RSS Feed

  1. Create a RSS feed for the news of each Group

Version 2.0

Austinguu 18:19, 30 September 2008 (UTC) says:
Sybrant, please do not quote or build Versions 2.0 and 3.0 yet
We have not finished spec'ing them.


Selective Article Storage

The objective is to hit the Xignite API only when needed: on URL visit and on hyperlink click. That way, we can pull more feeds without having to worry about the cost of licensing the data unnecessarily high amounts of data. Additionally, the application needs to make sure that the full text is not retrieved for bots as well.

  1. On RSS or XML feed retrieval (not full text), generate the appropriate News article link per the URL Schema
  2. When the URL is visited (http://domain/news/reuters/article-title...
  3. Check to see if the full text exists in the database
    1. Check the user agent to see if it is a bot (like Google). If it is, then only display the information currently in the database.
    2. If the user agent is not a bot and the full text does not exist in the database, then do not retrieve the full text.

Version 3.0

Copy and Paste Feed

  1. Build the ability for a Group to paste the URL of a RSS feed into a form
  2. This RSS feed will then be displayed under the Group News page

Category Tree

Personal tools