General Architecture for a Multilingual Information Retrieval system
Project Status: set-up
Start Date: September 2022
End Date: August 2025
Budget (total): 5101.48 K€
Effort: 109.77 PY
Name: Ahmet Sonmezisik
Company: Turkcell Teknoloji
BEIA Consult International S.R.L, Belgium
Code Creator, Czech Republic
Palestra, Czech Republic
Satturn, Czech Republic
ISEP – Instituto Superior de Engenharia do Porto, Portugal
CIC (Consulting Informatico de Cantabria SL), Spain
ARD Group, Turkey
Carbon Consulting, Turkey
Turkcell Teknoloji, Turkey
The project also demonstrates the use of the framework by different use cases such as multilingual text, image and video queries which will help to make more qualified and detailed searches and search within media.
Overall, the GAMIR project is an initiative to unify the efforts of European countries to employ their own NLP and AI based tools in cooperation to create and lower the barrier of implementing IR related applications.
Information retrieval systems are the basis of many state of art automated applications. The main application area of IR systems are search engines.
Today, Google dominates the entire search market by %91 penetration. There are also other important search engines in the Far-East region, like Baidu in China, Yandex in Russia and Naver in South Korea. These search engine companies have some billion USD market caps. On the other hand, their engagement with users make these companies important players in digital markets since they have vast knowledge on user identity.
When we look at Europe, we see some initiatives like Qwant and Seznam that try to encompass nationwide search engine needs. However they are not yet big enough to cater the requirements of different European countries.
Moreover, GAMIR is not only a framework for search engines but also a general framework for other IR related applications like voice assistance, context based queries and other types of applications.