Genetic Analysis Tool

Development of an online data source for sequence information, designed to meet scientists' and researchers' IP search needs, including patentability, Freedom-to-Operate (FTO), patent infringement, validity, and business intelligence.

Discuss an idea

All Technologies Used

Ruby
Ruby
PostgreSQL
PostgreSQL
JavaScript
JavaScript
Solr
Solr

Motivation

The goal was to create a comprehensive and accessible IP Research Portal that addresses the needs of scientists and researchers in finding genetic sequence information, ensuring accurate patentability analysis, and offering advanced data filtering and reporting tools.

Main Challenges

Challenge 1
Incomplete Search Results

Intellectual property experts state that about 80% of the information published in a patent document is not available anywhere else, and workflows often led to incomplete or overwhelming search results. Azati addressed this by developing a centralized, comprehensive database that integrates various patent sequences, ensuring thorough and accurate search results.

Challenge 2
Inefficient Data Analysis

The time taken for results analysis inhibited the IP sequence search process, and the overwhelming volume of results slowed down the ability to sift through data efficiently. Azati streamlined this process by implementing advanced search algorithms and filtering tools, drastically reducing analysis time and improving efficiency.

Challenge 3
Disparate and Expensive Resources

Multiple databases had to be accessed, creating slow, inefficient, and expensive workflows for researchers and experts, with difficulties in sharing and reporting results. We solved this by consolidating data into a single, user-friendly portal, offering easy access to relevant information and advanced reporting capabilities.

Key Features

  • Patent Sequence Database: The portal includes a comprehensive database with sequences and related data such as organism names, sequence length, and modification tables.
  • Advanced Search Algorithms: Users can search using BLAST, Smith-Waterman, Multiple Sequence Search, or MOTIF, offering flexibility and accuracy in patent sequence searches.
  • Data Filtering and Reporting: Extensive data filtering capabilities, advanced reporting features, and export options make it easy for users to analyze and present their results.
  • Cloud-based Infrastructure: The cloud-based solution ensures scalability, fast data processing, and smooth user experience even as the amount of data grows.

Our Approach

Centralized Data Access
We developed the SequenceBase IP Research Portal, a centralized, easy-to-use platform for accessing genetic sequences from published applications and patents dating back to 1982.
Comprehensive Search Tools
The portal includes multiple search algorithms like BLAST, Smith-Waterman, Multiple Sequence Search, and MOTIF, ensuring thorough and flexible IP sequence searches.
Cloud and Big Data Solutions
We implemented cloud and distributed processing technologies, allowing fast data delivery and ensuring the system can handle the increasing volume of data efficiently.
Data Updates and Speed
Data updates occur every 24 hours, with a big data strategy that ensures blazing fast processing and same-day data delivery to clients.

Project Impact

Improved Search Efficiency: The centralized portal and advanced search tools significantly reduced the time researchers spent on finding and analyzing patent sequences.

Faster Data Delivery: With a cloud-based, big data strategy, SequenceBase provided same-day data delivery to clients, improving efficiency and timeliness.

Better Research Capabilities: The portal effectively addressed the researchers’ needs for comprehensive and accessible IP sequence data, enhancing their ability to conduct valuable genetic research.

Ready To Get Started

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.