Yahoo! has opened up their content analysis technology to web developers. According to the company’s blog, “The newly launched Yahoo! Content Analysis service replaces Yahoo!’s popular Term Extraction service and now provides advanced content analysis on either text or a URL, leverages Yahoo!’s state of the art machine-learned ranking (MLR) technology to extract key terms from the content, and, more importantly, to rank them based on their overall importance to the content. The output you receive contains the keywords and their ranks along with other actionable metadata.”
It continues, “On top of entity extraction and ranking, developers need to know whether key terms correspond to objects with existing rich metadata. Having this entity/object connection allows for the creation of highly engaging user experiences. The Y! Content Analysis output provides related Wikipedia IDs for key terms when they can be confidently identified. This enables interoperability with linked data on the semantic Web.”
The post adds, “Finally, sometimes you’re interested in the whole document in addition to its key terms. We provide classification for your content as well. We’ll map your content or URL to our deep and descriptive taxonomy. Y! Content Analysis supports US English and Chinese languages for entity detection and ranking. Classification supports only English and more languages will be added gradually.”
Image: Courtesy Yahoo!