What was the main performance bottleneck in the software?

The clusterization logic in FASTAptamer, particularly the Levenshtein algorithm implemented in Perl, consumed most of the processing time.

How did Azati optimize the software?

Azati rewrote the Levenshtein algorithm in C++ and optimized the clusterization step, improving execution speed and memory handling.

What was the performance improvement?

The Levenshtein algorithm ran 1,000x faster, and the total software execution time was reduced 80x, from 48 hours to 30.5 minutes.

Was the optimization shared with the community?

Yes, the optimized algorithm was submitted to FASTAptamer’s official repository and included in version 1.0.12, benefiting all users.

80-Fold Software Performance Improvement

Azati significantly accelerated the client’s DNA sequence processing software by identifying and optimizing a critical bottleneck in the FASTAptamer toolkit. The team rewrote the core clusterization logic from Perl to C++, achieving an 80x reduction in total execution time and a 1,000x improvement for the embedded Levenshtein algorithm.

Discuss your project

80x

overall software performance improvement

1,000x

Levenshtein algorithm execution speedup

30.5 min

end-to-end processing time after optimization

Technologies used

C++

Perl

Motivation

The client needed a dramatic acceleration of their DNA sequencing pipeline, which processed vast amounts of biological data. Manual research workflows were slowed by a software bottleneck, delaying experiments and reducing lab productivity. The goal was to shorten processing time while maintaining result accuracy, enabling faster research cycles and timely delivery of insights.

Main challenges

The client’s DNA sequencing software took approximately 48 hours to process a dataset because the FASTAptamer toolkit had a major performance bottleneck. This delay disrupted research timelines and impacted productivity. Azati analyzed the pipeline, pinpointed the slowest steps, and proposed performance engineering solutions to optimize them using low-level programming techniques.

The clusterization step relied on the Levenshtein algorithm implemented in Perl, which was inefficient for high-throughput data. Azati proposed rewriting this logic in C++ to exploit faster execution, better memory handling, and native compilation advantages, drastically reducing processing time while maintaining accuracy.

Our approach

Pipeline Analysis

Analyzed the client's DNA sequencing pipeline to locate performance bottlenecks.

Bottleneck Identification

Determined that the clusterization program in FASTAptamer, particularly the Levenshtein calculation, consumed the majority of execution time.

Benchmarking and Root Cause Analysis

Benchmarked the Perl implementation and confirmed inefficiencies due to language limitations in high-throughput operations.

Algorithm Rewriting in C++

Rewrote the Levenshtein algorithm in C++ to enhance execution speed and memory efficiency.

Integration and Validation

Integrated the optimized C++ algorithm into the client’s pipeline and validated results to ensure consistency and accuracy.

Open Source Contribution

Submitted the improved algorithm to the official FASTAptamer repository, which was merged in version 1.0.12, benefiting the global bioinformatics community.

Facing the same challenge?

Bring your complexity. We'll bring the plan. Select a convenient slot to start a conversation with our experts.

Schedule a call

Solution

Optimized Clusterization Logic

The original Levenshtein algorithm in Perl was a major performance bottleneck. Azati rewrote it in C++ to leverage efficient memory management and faster computation, allowing the software to process sequences thousands of times faster without changing the output results.

Key capabilities:

High-speed Levenshtein calculation
Efficient memory management
Support for high-throughput sequence processing
Seamless integration with existing pipeline

Pipeline Acceleration

Beyond the algorithm rewrite, Azati optimized the full DNA sequencing workflow, removing unnecessary delays and improving data handling across modules. This reduced total execution time for datasets from 48 hours to just 30.5 minutes, massively increasing research throughput and lab efficiency.

Key capabilities:

80x overall pipeline speedup
1,000x algorithm execution improvement
Maintains result accuracy
Significant reduction in research wait times

Open Source Integration

The optimized Levenshtein algorithm was submitted to FASTAptamer’s official repository and merged in the subsequent release. This not only improved the client’s performance but also contributed to the wider bioinformatics community, enabling all users to benefit from faster DNA sequence analysis.

Key capabilities:

Contribution to official toolkit
Ensures reproducibility for all users
Supports collaborative development
Widespread adoption of performance improvements

Results & business impact

Faster Data Processing

Reduced total runtime from 48 hours to 30.5 minutes, drastically improving research productivity.

Validated Accuracy

Output results remained consistent, ensuring scientific integrity and reproducibility.

Community Benefit

Optimization accepted into the mainstream toolset, benefiting all FASTAptamer users worldwide.

Last updated

2026-05-22

Got a job for Azati? Let’s talk business!

Full Name^*

Email^*

Upload additional information or RFP

Browse files

Your request^*

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

What's next?

1. Tell Us Your Story

Describe your project. We come back within 24 hours with team availability and a rough plan. NDA on request before the first call.
2. Get Your Roadmap

Receive a detailed proposal with scope, team composition, timeline, and costs tailored to your goals.
3. Start Building

Azati aligns on details, finalize terms, and launch your project with full transparency.

80-Fold Software Performance Improvement

Technologies used

Motivation

Main challenges

FASTAptamer Performance Limits Speed

Inefficient Sequence Clustering in Perl