Posts Tagged ‘News Aggregation’

Trend-Indicator: “Unilateral Reporting”

Thursday, March 25th, 2010

The Following directed graph shows the “top 5″ countries in the news (grouped by countries).

The query was done for all stored articles since January 2010 until now.

The edges are labelled with the percentage share of all articles for the origin country:

Q1 Country Interest Crunched

Q1 Country Interest

You can find the original picture here.

Votes on Kungle.de

Friday, October 30th, 2009

A short summary about the new vote feature.

Last Week Results

Total Votes: 623

Votes by Topic:

TopicVotes
Science183
Politic101
Technology95
Economy80
Boulevard78
Entertainment50
Religion24
Sport9
Adult3

Pie Chart:

VotesOct

Why should you vote?

Because with all collected votes it will be possible to calculate better results for following matters:

  • Offer new subtopics
    For example visitors are especially interested in Science/Health, Entertainment/Bollywood or Politic/War. Subtopics will be implemented in future versions of  the application.
  • Forecast the period of time a news entry is relevant
    First indications are, that Boulevard-News have the longest lasting relevance.
  • Estimate publisher/editor reputations
    More votes mean a higher reputation for a publisher.

Vote Guidance

You cast a vote by pressing the check-mark on the right-hand side of the news-entry.

unselected

You have ten votes in total.

Your Privace

Kungle.de will not store any personal information about your visit.

In Detail

Kungles web server like any other web server on the internet stores your IP in a log file. Independent from this logfile  Kungles application server stores your selection and votes temporarily in memory.  It is very difficult to connect these two information sources to generate an individualized user profile.

Finally the database counts the votes passed by the application server and logs theses events independent from the two other logs.

architecture

I’m not interested in individual user behavior. 

Recent events showed that you should be very careful with your personal data:

http://www.kungle.de/Trend/entry/275800

http://www.kungle.de/Trend/entry/275766

http://www.kungle.de/Trend/entry/275526

Therefore I see the necessity to inform you about Kungles information processing.

This Week on Kungle.de: Nobel Prizes, Riots in Pakistan and the “Balloon Boy”

Friday, October 16th, 2009

Three issues with hundreds of similar news publications blocked the front page of Kungle.de. Each publication is interesting and informative by itself  but together they are hiding other noteworthy information.

I concluded that it was about time to build a new subsystem to reduce the amount of identical information. You can still find all articles via the new “related link”.

The new Subsystem “IssueMerger“ now merges  news with similar content. The older news entries are the more likely  they are consolidated to one issue.

For this, I defined a function to calculate the proximity of two entries. (The Result is 1 if two news entries  are identical and 0 if they completely different.)

It is necessary to  build a complete “News Topology” (A Matrix with up to 1.5 million elements) which defines the proximities of all entry combinations.

The calculation for all topics requires up to 40 hours. The Algorithm itself was coded in 80 lines of scala.

You can find a calculated result here:
http://www.kungle.de/Trend/entry/220033

Update 1: In comparison this merge was hand made:

http://www.kungle.de/Trend/entry/225189

Technical Trouble

Friday, September 18th, 2009

Hacker and Botnets had been very alive this week. Several night-shifts were necessary to keep kungle alive. However some drop outs were unavoidable.

Features:

The development of  the translation feature has begun. The results are adequate so far.

Kungle.de as Pie Chart / New Features

Friday, September 11th, 2009

Some fresh statistics from my database:

New: I’m now following political, cultural and news blogs.

If you want to participate create your own free blog (for example here wordpress.com or here www.blogger.com) and leave me a note or comment with your address of your website.

If you like, use your native language. I will add a new feature  “automatic translation” in the next few days.

Kungle.de News

Friday, August 14th, 2009

This and next week I’m heavily loaded with business & finance. Therefore the Kungle.de development is halted.  This is really distressing for me because I have planned to add the feature “related images” to my front page.

Another short term goal is the improvement of Kungle’s ability to read additional news sources. For example Mongolia, parts of Africa and South America aren’t accessible at the moment.

Long term goals (Next 30 Days)

  • I will start to present additional data.  Every News Entry receives an ID card with additional information.  Don’t panic! The direct link to the article remains.
  • The next phase of topic identification will be released. With enough collected indicators it is possible to use advanced algorithms.

Of course you can send me your comments and critics at any time to support@kungle.de or leave a comment here.