Please note that the details of this Challenge are no longer open. This challenge is awarded and is no longer accepting new submissions. You can:
Challenge Thomson Reuters Eikon Text Tagging Challenge
Thomson Reuters Eikon Text Tagging Challenge
STATUS: Awarded
Active Solvers: 433
Posted: May 23 2014
Challenge ID: 9933333
Share Challenge Share

Investors are flooded every day by thousands of news items from many different sources – blogs, feeds, internal reports, news services – yet only a small number of those items may be of interest to a particular investor.  Filtering these news items for companies or organizations of interest is becoming increasingly important as the number and variety of sources expands, yet it is becoming more difficult because many of these news items are not explicitly tagged with the relevant companies or organizations.  The Seeker, Thomson Reuters, is searching for an algorithm to accurately tag incoming news items by relevance for companies or organizations mentioned within the news item.

This is a Reduction-to-Practice Challenge that requires written documentation and delivery of source code implementing an algorithm that solves the problem.  Additionally, as a Prodigy Challenge a real-time online scoring utility and leaderboard will be available to track Solver algorithm performance.

This Challenge has a special award structure with awards of $20,000, $10,000, and $5,000 for 1st, 2nd, and 3rd place, respectively for granting the Seeker a non-exclusive license to practice the solutions.  In addition, the Seeker may, at its sole discretion, award an additional $20,000 (“Additional Award”) to one or more of the submissions awarded a Licensing Award to obtain ownership of the solution.  Awards will be based on Seeker’s determination of solution performance using a reserved independent validation set.


Thomson Reuters is seeking an algorithm to accurately tag, by relevance per company or organization, incoming news items from a variety of sources such as blogs, feeds, internal reports, and/or news services.  The Challenge involves tagging items by the companies or organizations relevant to the item while avoiding false positive tags – a story about the apple crop in Washington State should not be tagged for Apple, Inc. or Washington State University.  This is a computational Challenge and you will be provided a training data set of 2000 stories with correct tags for algorithm development and a test data set of 2000 different stories without tags against which you will run your algorithm and upload the results for online scoring.  A real-time online leaderboard will keep Solvers up-to-date on the current high scores for the Challenge.

Final submissions to the Challenge should include the following:

  1. A detailed description of the proposed algorithm, and how it addresses each Technical Requirement presented in the Detailed Description of the Challenge.  The project documentation should include a well-articulated rationale for the algorithm.
  2. Source code implementing the proposed algorithm. Include all dependencies, packages, databases, documentation, and information to generate test results by the Seeker.
  3. Full Documentation for the algorithm, including the following: (i) Instructions on installation and execution of the code. (ii) Description of any external dependencies. External public datasets and external open source code are allowed – be sure to indicate them and include any necessary files and instructions. (iii) Indication of the target platform (see Technical Requirements above for technical details).

This Challenge has a special award structure with awards of $20,000, $10,000, and $5,000 (“Licensing Awards”) for 1st, 2nd, and 3rd place, respectively, for granting the Seeker a non-exclusive license to practice the solutions.  In addition, the Seeker may, at its sole discretion, award an additional $20,000 (“Additional Award”) to one or more of the submissions awarded a Licensing Award to obtain ownership of the solution.

The award is contingent upon evaluation and validation of the submitted Solutions by the Seeker. During the evaluation period, the Seeker will validate the performance (accuracy and speed) of top-scoring submissions using additional data similar to the training data provided in the Challenge.

To receive a Licensing Award, the Solvers will not have to transfer their exclusive IP rights to the Seeker.  Instead, they will grant to the Seeker non-exclusive license to practice their solutions.  To receive an Additional Award, the Solvers will have to transfer to the Seeker their exclusive Intellectual Property (IP) rights to the solution. 


NOTE: Employees of Thomson Reuters are not eligible to participate in this Challenge.



Thomson Reuters is the world's leading source of intelligent information for businesses and professionals. We combine industry expertise with innovative technology to deliver critical information to leading decision makers in the financial and risk, legal, tax and accounting, intellectual property and science and media markets, powered by the world's most trusted news organization.

Thomson Reuters Eikon is a powerful and intuitive next-generation solution for consuming real-time and historical data, enabling financial markets transactions and connecting with the financial markets community. Its award-winning news, analytics and data visualization tools help its users make more efficient trading and investment decisions across asset classes and instruments including commodities, derivatives, equities, fixed income and foreign exchange. Thomson Reuters Eikon is a leading desktop and mobile solution that is connected, informed, intelligent and open, and provides access to a messaging community of over 210,000 financial professionals.

Put Your Knowledge in Motion

Intelligent information starts with talent. Businesses and professionals all over the globe rely on the people of Thomson Reuters to transform knowledge into action, so they can shape outcomes on the world stage. In return, we believe careers shouldn't be confined. The breadth and global reach of our business offers virtually unlimited opportunities to shape a career path that matches the contours of your talents, interests and goals.

For more information about our Financial & Risk business and what we do please visit our careers page here.

What is InnoCentive?
InnoCentive is the global innovation marketplace where creative minds solve some of the world's most important problems for cash awards up to $1 million. Commercial, governmental and humanitarian organizations engage with InnoCentive to solve problems that can impact humankind in areas ranging from the environment to medical advancements.

What is an RTP Challenge?

An InnoCentive RTP (Reduction to Practice) Challenge is a prototype that proves an idea, and is similar to an InnoCentive Theoretical Challenge in its high level of detail. However, an RTP requires the Solver to submit a validated solution, either in the form of original data or a physical sample. Also the Seeker is allowed to test the proposed solution. For details about treatment of IP rights, please see the Challenge-Specific Agreement.

Share This Challenge
InnoCentive Trust Partners