You are here:  Home  >  Data Education  >  Current Article

Amtera Breaks Out Semantic Relatedness API

By   /  September 9, 2013  /  No Comments

Amtera, a Brazilian-based startup whose main solutions are focused on semantic enterprise search and knowledge management, is exploring the API business model.  Its first step in that space is its new semantic relatedness API, which is a piece of its overall pipeline technology for querying Linked Data. Its main products are Esprit for enabling semantic search by meaning rather than keyword, and Sentient, for social media analysis.

The API, the company says, supports applications accessing large-scale distributional knowledge bases, offering word sense disambiguation and classification so developers can deliver knowledge models assured of the correct determination of words.

The API, as it is currently exposed, is for comparing pairs of terms in text, and for disambiguating the words into the right meaning with a minimum level of information. “Star,” for example, can refer in an ontology or taxonomy to a celestial body or to a performer or actor. The word can be disambiguated into its proper sense via its pairing with other information found in surrounding text – if the context for the use of the word “star” involves the term “Angelina Jolie,” the pairing is with the latter meaning of star.

The API’s core operation is to compute the semantic relatedness measure, to allow the systematic determination of the semantic proximity between terms. “This way allows a more flexible way to align ontologies, to compare things in a more flexible and complete way,” says co-founder, partner and CEO João Gabriel Oliveira. “When you create knowledge models based just on ontologies, though they might be extremely correct, they tend to be not that complete. You couldn’t cover all the possibilities. We think eventually, by using both together, you can cope with this gap in completeness.”

The company’s API plans, Oliveira says, involve scaling elastically through the cloud. “Right now,” he says, “we could compute about 300 pairs per second, which is very much….I don’t know any other implementation of this kind of solution, this kind of semantic model, that is that fast.” Because it is a linearly scalable model, Amtera could also increase the throughput, he says.  “The technology itself is highly optimized, so we can elastically increase the scale and easily cope with performance requirements,” he says.

Currently, the API service is available in Portuguese and English versions, but once the company has pinned down its method of validating the model, it can be easily generated in other languages. Expect Spanish, French and German capabilities soon. There is a freemium offering that supports up to 5,000 pairs per day, as well as licensed options.

Oliveria notes that the company also could exploit other aspects of its core technology in the API format, such as relation extraction.



About the author

Jennifer Zaino is a New York-based freelance writer specializing in business and technology journalism. She has been an executive editor at leading technology publications, including InformationWeek, where she spearheaded an award-winning news section, and Network Computing, where she helped develop online content strategies including review exclusives and analyst reports. Her freelance credentials include being a regular contributor of original content to The Semantic Web Blog; acting as a contributing writer to RFID Journal; and serving as executive editor at the Smart Architect Smart Enterprise Exchange group. Her work also has appeared in publications and on web sites including EdTech (K-12 and Higher Ed), Ingram Micro Channel Advisor, The CMO Site, and Federal Computer Week.

You might also like...

James Kobielus

Data Science to Change The World or Scratch an Itch

Read More →