AiRo Provides Citation Consolidation Solution for more than a Million Bio Pharma Products

Client Overview:

The client is globally the largest repository that manages standard reference microorganisms, cell lines, and other materials for research and development purposes.

The client being a leading provider of biological materials, and standards, wanted to collect the citation details for its million+ products from different sources and link them in the organization’s portal.

Problem:

Client required a solution to consolidate citation details for easy access

The client needed a solution that would consolidate the citation details for the client’s products and make them available on their portal for easy access. The objective of the initiative was to develop a web crawling solution that would extract the citation details for client’s products from various sources, such as PubMed and EuropePMC, on a weekly basis. The solution would also extract author profiles and concerned hyperlinks and download and store research articles wherever applicable.

Solution:

AiRo proposed a AI-enabled next-gen citation solution for the client’s portal

AiRo proposed to adopt Artificial intelligence by combining multiple AI technologies to create a solution that would yield the best possible benefits for clients. The proposed solution architecture included several layers that would extract information from different sources, download it, convert it to a common format, and insert it into a database.

The layers would use bespoke Python programming and Robotic Process Automation (RPA) to implement the web crawling solution. The proposed methodology included downloading data for each source, extracting information for each source, converting it to a common format, and inserting it into a database.

AiRo wins client confidence with its solution architecture

The solution architecture proposed by AiRo included several layers, such as the extraction of information, download of data, conversion to a common format, and database operations. The layers would use Python scripts and SQL Server DB operations to implement the web crawling solution.

The solution would use regex-based search on the data to extract the information and custom logic to convert it to a common format. The solution would also include pre-processing and cleaning of data where required. The web crawler would perform API calls with input query and date range and split the date range into months to download data iteratively through API calls with a gap of 3 seconds as suggested by some sources of information like PubMed and EuropePMC.

The client recorded significant gains post deployment

AiRo managed quality, risk and issues, change management, and governance, reporting, and communication plan. The web crawling solution was implemented in three environments: Development, Test, and Production.

The web crawling solution successfully fetched the citation details for client’s products from various sources on a weekly basis and uploaded them back to the client’s portal. The web crawler consolidated the citation details for client’s products and made them available on the client’s portal for easy access by users.

Latest honors

Advanced AI

Foundational AI

Our AI partners

Our cloud partners

Our data partners

Our cybersecurity partners

Immerse yourself in the Airo vibe

Explore how we make the future possible at Airo.

We are proud to be a certified Great Place to Work®.

Job openings

Airo in action

From our desk

Making global headlines

Let’s meet