1. Introduction

Phylogeny.fr has been designed to provide a high performance platform that transparently chains programs relevant to phylogenetic analysis in a comprehensive and flexible pipeline. Although phylogenetic aficionados will be able to find most of their favorite tools and run sophisticated analyzes, the primary philosophy of Phylogeny.fr is to assist biologists with no experience in phylogeny in analyzing their data in a robust way.

Note: The Phylogeny.fr platform offers a phylogeny pipeline which can be executed through three main modes, designed to fit your specific expertise level and analysis requirements.

Alternatively, users have the possibility to run the different programs separately in the Tools section.

2. Phylogeny Analysis

2.1 "One Click" mode

This is a "default" mode which proposes a pipeline already set up to run and connect programs recognized for their accuracy and speed to reconstruct a robust phylogenetic tree from a set of sequences. The workflow depends on the chosen execution mode (Fast or Accurate):

  • Mafft for rapid and robust multiple sequence alignment.
  • Optionally ClipKit for alignment curation (trimming unreliable regions).
  • FastTree (Fast mode) or IQ-TREE (Accurate mode) for phylogeny.
What users have to do is just to upload their FASTA file or paste their sequences and pick an execution speed. The system will handle everything else, automatically formatting data between steps and applying scientifically sound default parameters.

You have the ability to toggle the curation step (ClipKit) to eliminate poorly aligned positions and highly divergent regions before the tree inference step. At the end of the analysis, a beautiful, interactive rendition of the generated tree will be available for you to interact with and export.

Default Parameters

For transparency, below is the exhaustive list of the parameters used by the tools behind the scenes during a One Click run:

ParameterDefault Value
Mafft (Alignment)
Alignment strategyDefault (FFT-NS-2)
ClipKit (Alignment Curation)
ModeGappy (removes columns with high gap frequency)
Gaps threshold0.4 (40%)
FastTree (Fast Phylogeny)
Evolutionary ModelDefault (JTT+CAT for proteins, JC+CAT for DNA)
Branch SupportSH-like local supports
IQ-TREE (Accurate Phylogeny)
Model SelectionAuto (ModelFinder)
Branch SupportUltrafast Bootstrap (5000 replicates)
Burn-in iterations200
PhyloTree Tool (Tree Utilities)
RootingMidpoint Rooting

2.2 Advanced mode

The Advanced mode follows the same robust pipeline philosophy as "One Click" mode, utilizing a predefined succession of programs (Mafft, ClipKit, and FastTree). However, unlike One Click mode, it provides you with complete control over the parameters for each step of the analysis.

This mode is designed for intermediate to advanced users who want to fine-tune their phylogenetic reconstruction without dealing with the complex selection of individual tools from scratch.

Tip: If you are unsure about a specific parameter, the Advanced mode forms provide descriptive tooltips for each option to help you make scientifically sound choices.

2.3 "A la carte" mode

The "A la carte" mode provides the ultimate flexibility for building your phylogenetic analysis pipeline. Unlike the predefined workflows of One Click and Advanced modes, this mode allows you to handpick each tool for every step of the analysis, giving you complete control over your methodology.

Ideal for: Researchers who want to compare different tools, test specific methodologies, or have particular requirements that the standard pipelines do not cover.

Two-Phase Workflow

The A la carte mode operates in two distinct phases:

  1. Pipeline Design: Select which tools you want to include in your analysis. A visual pipeline at the top of the page shows your current selection in real-time.
  2. Configuration & Launch: After confirming your tool selection, configure each tool's parameters individually, upload your FASTA input, and submit the analysis.

Available Tools by Step

ToolDescription
Step 1: Multiple Alignment
MUSCLEFast and accurate aligner, ideal for small to medium datasets
T-CoffeeConsistency-based aligner with high accuracy for divergent sequences
ClustalOScalable aligner optimized for large datasets using HMM profiles
MafftVersatile aligner with multiple strategies (FFT-NS, L-INS-i, etc.)
Step 2: Alignment Curation (Optional)
GblocksRemoves poorly aligned regions using a block-selection approach
ClipKitModern trimming tool that preserves phylogenetically informative sites
BMGERemoves highly variable or ambiguous regions based on entropy
Step 3: Phylogenetic Tree
PhyMLMaximum Likelihood inference with model selection support
BioNJFast distance-based method, good for exploratory analyses
MrBayesBayesian inference using MCMC sampling for posterior probabilities
IQTreeState-of-the-art ML with ultrafast bootstrap and ModelFinder
RAxML-NGOptimized ML for large-scale phylogenomic datasets
FastTreeApproximate ML, extremely fast for large alignments
Flexible Selection: You are not required to select a tool for every step. For example, you can run only an alignment tool, or skip the curation step entirely. However, at least one tool must be selected to proceed.

How It Works

  1. Navigate to the A la carte page from the main menu.
  2. Click on the tool tiles to select your preferred tool for each step (Alignment, Curation, Phylogeny). Selected tools are highlighted with a checkmark.
  3. Click "Continue to Configuration" to proceed.
  4. Upload your FASTA file or paste your sequences directly.
  5. Configure each selected tool's parameters using the provided forms.
  6. Optionally enter your email to receive a notification when the analysis completes.
  7. Click "Submit Analysis" to start the pipeline.
Note: Make sure your input data flows correctly through the pipeline. For instance, phylogeny tools expect an alignment as input—if you skip the alignment step, ensure your input file is already a valid multiple sequence alignment.

3. BLAST Explorer

The BLAST Explorer allows you to search for sequences similar to your query within major biological databases. Beyond standard BLAST searches, our platform automatically builds a phylogenetic tree from the top hits, enabling you to explore the evolutionary relationships among homologous sequences interactively.

Unique Feature: Unlike traditional BLAST services, Phylogeny.fr automatically aligns your query with the best hits using Mafft and reconstructs a phylogenetic tree using FastTree—giving you instant evolutionary context.

3.1 Submitting a query

To run a BLAST search, you need to provide a single sequence in FASTA format. You can either upload a file or paste your sequence directly into the text area.

BLAST Programs

Select the appropriate program based on your query and target database types:

ProgramQueryDatabaseUse case
blastpProteinProteinFind homologous proteins
blastnNucleotideNucleotideFind similar DNA/RNA sequences
blastxNucleotideProteinTranslate DNA and search proteins
tblastnProteinNucleotideSearch protein against translated DNA

Available Databases

Choose from a variety of regularly updated sequence databases:

DatabaseTypeDescription
Protein Databases
nr_clusterProteinNCBI non-redundant protein sequences (clustered)
SwissProtProteinCurated, high-quality UniProt entries
UniRef90ProteinUniProt clustered at 90% identity
PDBProteinProtein Data Bank sequences with known 3D structures
RefSeq Viral ProteinProteinNCBI curated viral protein sequences
Nucleotide Databases
Core NucleotideNucleotideNCBI core nucleotide collection
PDBNucleotidePDB nucleotide sequences
RefSeq Viral GenomicNucleotideNCBI curated viral genomic sequences

E-value Threshold

The E-value (Expect value) represents the number of hits you would expect to find by chance. A lower E-value indicates a more significant match:

  • 1.e-5 (default): Good balance between sensitivity and specificity.
  • 1.e-10 to 1.e-30: More stringent, returns only highly confident hits.
  • 0.1 to 1: More permissive, may include distant homologs.
Tip: If your initial search returns no hits, try increasing the E-value threshold or switching to a broader database like nr_cluster.

3.2 Interactive results

Once your BLAST job completes, you are presented with an interactive visualization combining a phylogenetic tree and a powerful floating Toolbox panel for filtering and exporting your results.

Phylogenetic Tree Display

The tree displays your query sequence alongside the top BLAST hits, allowing you to visualize evolutionary relationships at a glance:

  • Query highlighting: Your input sequence is visually emphasized in the tree.
  • Taxonomy coloring: Sequences are colored by taxonomic group (adjustable depth).
  • Interactive selection: Click directly on tree leaves to select or deselect sequences.
  • External links: Click on sequence IDs to access their original database entry.

The Toolbox Panel

The floating Toolbox is your control center for exploring BLAST results. It can be dragged anywhere on screen and offers two distinct modes via tabs:

Tree View Tab

This tab focuses on the top hits displayed in the phylogenetic tree (typically the best-scoring sequences). Use it for precise, small-scale selection:

  • Selection counter: Shows how many sequences are currently selected out of total tree leaves.
  • Quick select buttons: Instantly select all or deselect all sequences in the tree.
  • Bitscore filter pills: Filter sequences by score ranges (<40, 40-50, 50-80, 80-200, >200) with color-coded pills.
  • Taxonomy depth slider: Adjust the taxonomic classification level displayed (1=Kingdom to 8=Species level).
  • Gap-free indicator: Shows the percentage of alignment positions without gaps for your current selection—useful for assessing alignment quality.
All Hits Tab

This tab operates on all BLAST hits, not just those in the tree. Use it for large-scale filtering across the entire result set:

  • E-value threshold: Dynamically filter hits by E-value (from 1e-5 to 1e-100).
  • Histogram filters: Open interactive histograms to select sequences by:
    • Score (bitscore): Filter by alignment score strength.
    • Similarity (% identity): Filter by sequence similarity percentage.
    • Coverage (% query coverage): Filter by how much of your query is covered.
  • Taxonomy filter: Open a hierarchical taxonomy tree to select sequences from specific taxonomic groups (e.g., only Bacteria, only Vertebrates).
  • Reset selection: Clear all filters and start fresh.
Tip: Use the All Hits tab when you need to work with sequences that didn't make it into the tree visualization. The histogram filters let you identify and select sequences based on precise score ranges.

Exporting & Pipeline Integration

Both tabs provide export options for your selected sequences:

  • Download FASTA: Export selected sequences as a FASTA file for external analysis.
  • Send to Pipeline: Directly transfer selected sequences to the One Click or Advanced phylogeny pipelines. The sequences are automatically loaded into the analysis form.
Seamless Workflow: A typical workflow might be: run BLAST → explore the tree → filter by taxonomy or score → send a curated subset directly to the phylogeny pipeline for detailed analysis.

Taxonomy Legend

Below the tree, a dynamic legend displays the taxonomic groups present in your results with their associated colors. The groups shown depend on the taxonomy depth setting in the Toolbox.

4. Tools & Versions

Below is the list of all bioinformatics tools integrated in the platform, along with their current version.

ToolDescriptionVersion
BlastBasic Local Alignment Search Tool2.16.0
MMseqs2Ultra-fast and sensitive sequence search and clusteringsse2
MafftMultiple sequence alignment program for large datasets7.526
MuscleFast and accurate multiple sequence alignment5.3
Clustal OmegaFast multiple sequence alignment1.2.4
TCoffeeAdvanced multiple sequence alignment program13.46.0.919e8c6b
GBlocksAlignment curation — eliminates poorly aligned and divergent regions0.91b
ClipKitAlignment curation using smart-gap trimming2.11.4
BMGEAlignment curation using gap trimming2.0
FastTreeApproximately-maximum-likelihood phylogenetic tree inference2.2.0
IQ-TreeMaximum-likelihood phylogenetic inference with automatic model selection2.4.0
RAxML-NGRandomized Axelerated Maximum Likelihood phylogenetic inference2.0.0
MrBayesBayesian inference of phylogenetic trees3.2.7a
PhyMLMaximum-likelihood phylogenetic inference3.3.20241207
FastphyloFast tools for phylogenetics — distance computation and neighbor-joining1.0.1
BioNJNeighbor-joining phylogenetic inference1.0