Releases · exomiser/Exomiser

29 Feb 14:00

julesjacobsen

14.0.0

ffb2da1

New Java version, new database format, smaller data downloads, more ACMG categories, better reporting... Latest

Latest

Minimum Java version is now Java 17
Update database format REQUIRES DATABASE VERSION 2402 - these are significantly smaller than the previous versions (~50-60% of previous size) See the GitHub discussions section for details.
Added new GeneBlacklistFilter #457
Add new ClinVar conflicting evidence counts in HTML output #535
Added PS1, PM1, PM5 categories to ACMG assignments
Altered reporting of InheritanceModeFilter to state that the number shown refers to variants rather than genes.
Updated gene constraints to use gnomad v4.0 data.
TSV genes, TSV variants and VCF outputs will only write to a single file where the possible modes of inheritances are now shown together rather than split across separate files.
Fix for issue #531 where the priorityScoreFilter and regulatoryFeatureFilter pass/fail counts were not displayed in the HTML.
Fix for issue #534 where variant frequency and/or pathogenicity annotations are missing in certain run configurations.
Fix for issue #541 where logging to /tmp/spring.log causes clashes in shared user environments.
TSV output column CLINVAR_ALLELE_ID has been changed to CLINVAR_VARIANT_ID to allow easier reference to ClinVar variants.

Full Changelog: 13.3.0...14.0.0

Assets 5

0 Join discussion

26 Oct 10:30

julesjacobsen

13.3.0

48875d2

MT codon tables and Bayesian ACMG

Updated Jannovar version to 0.41 to fix incorrect MT codon table usage #521
Downgraded PM2 - PM2_Supporting for variants lacking frequency information #502.
Updated Acgs2020Classifier and Acmg2015Classifier to allow for PVS1 and PM2_Supporting to be sufficient to trigger LIKELY_PATHOGENIC
Updated AcmgEvidence to fit a Bayesian points-based system #514
Removed ASJ, FIN, OTH ExAC and gnomAD populations from presets and examples #513.
Fix for regression causing <INV> variants to be incorrectly down-ranked
Fix for issue #486 where VCF output includes whitespace in INFO field.
Logs will now display elapsed time correctly if an analysis runs over an hour (!).

Full Changelog: 13.2.1...13.3.0

Assets 5

0 Join discussion

07 Jul 09:02

julesjacobsen

13.2.1

31e3020

SV `<INS>` bugfix

This is a bugfix release to address the blanket scoring of <INS> variants with a variant score of 1.0. The fix should increase the accuracy of SV call prioritisation.

Fix for bug where all <INS> structural variants were given a maximal variant score of 1.0 regardless of their position on a transcript.
Added partial implementation of SVanna scoring for coding and splice site symbolic variants.
Fix for issue #481 where TSV and VCF results files would contain no data when the analysis inheritanceModes was empty.

IMPORTANT! This will be the last major release to run on Java 11. Subsequent major releases (i.e. 14+) will require Java 17.

Assets 5

0 Join discussion

28 Feb 18:54

julesjacobsen

13.2.0

82ddeac

Sometimes it's the little things...

This release adds a couple of minor quality of life features to the CLI and fixes a few bugs.

New multi-architecture docker images with and without bash #471. These images can be found on https://hub.docker.com/repositories/exomiser
Deprecated of output-prefix CLI option (will be removed in next major version) #469
Added output-directory and output-filename CLI options to replace output-prefix #469
Added output-format CLI option #471
Fixed excessive CPU usage and application hang after variant prioritisation with large number of results #479
Fixed issue #478 where gene.tsv output files are empty when running a phenotype only prioritisation.
Fixed broken links to OMIM in the phenotypic similarity section of the HTML output #465
Added gene symbol as HTML id tag in gene panel HTML results #422

Assets 4

0 Join discussion

29 Jul 13:45

julesjacobsen

13.1.0

727c90f

Automated ACMG, p-values, simpler output, documentation!

The three new features for this release is the automated ACMG classification of small sequence variants, calculating
p-values for the combined scores and providing new and more interpretable TSV and VCF output files.

Added new automated ACMG annotations for top-scoring variants in known disease-causing genes.
Added new combined score p-value
Added new TSV_GENE, TSV_VARIANT and VCF output files containing ranked genes/variants for all the assessed modes of
inheritance. Note that these new file formats will supersede the existing individual MOI-specific TSV/VCF files which
will be removed in the next major release. See the online documentation for details.
New update online documentation! See https://exomiser.readthedocs.io/en/latest/
New Docker hub images for CLI and web on https://hub.docker.com/u/exomiser
Added checks to ensure user specifies genome assembly if user specifies VCF path outside of phenopacket/analysis
Added --output-prefix option to enable output prefix directly on the command line
Updated examples to use the latest recommended settings as per preset derived from 100,000 genomes project

for the latest data, please follow the discussions for announcements: #424

Assets 3

0 Join discussion

23 Nov 14:25

julesjacobsen

13.0.1

e2987ae

hg38 only configuration bugfix

Bug-fix release. No external changes.

CLI changes

Bug fix for issue #410 where application fails to start when only specifying hg38 data in application.properties

Assets 3

23 Sep 13:53

julesjacobsen

13.0.0

54a50c8

Phenopackets and Structural Variants

This release is primarily focussed on enabling simultaneous prioritisation of structural and non-structural variation
with as consistent an API as possible for both types of variation. It also introduces a new API for specifying richer
information about a Sample based on the v1 GA4GH Phenopacket

This release requires data version >= 2109 and Java version >= 11 (Java 17 recommended).

Until we're able to upload the data to the usual data.monarchinitiative.org/exomiser/latest you can download the data using these links:

2109_hg19
2109_hg38
2109_phenotype

CLI changes

Minimum Java version is now set to Java 11
New structural variant interpretation alongside small variants - requires data version 2109 or higher. This has
been tested using Manta and Canvas short-read callers and pbsv long-read caller.
New command line options for more flexible input: --sample --output, --vcf, --batch, --preset --assembly --ped . Run
--help for details
Phenopackets v1.0 can be used to input sample phenotype data
Added ability to specify proband age and sex in input options either via a phenopacket or the 'sample' format
Improved MOI disease - phenotype matching with added Orphanet MOIs
Improved incomplete penetrance calculation when using the ANY mode of inheritance option
Added a minExomiserGeneScore option for limiting the output genes to have a mimimum Exomiser combined score. This is
disabled by default. If enabling it, we recommend using a minimum score of 0.7
BREAKING CHANGE - JSON output changes pos renamed as start, chrmosomeName renamed as contigName.
Deleted chromosome field (use contigName). New fields: end, length, changeLength and variantType

Core API

API breaking changes:

New target Java version set to 11
Exomiser.run() now requires Sample and Analysis arguments
AnalysisRunner interface now requires Sample and Analysis arguments
Analysis fields vcfPath, pedigree, probandSampleName and genomeAssembly moved to new Sample class
PedigreeSampleValidator moved from util into new sample package
Replaced SampleIdentifierUtil with SampleIdentifiers class
Replaced SampleIdentifier with SampleData
Variant now extends org.monarchinitiative.svart.Variant - see https://github.com/exomiser/svart/ for details
Deprecated VariantCoordinates - replaced by org.monarchinitiative.svart.Variant
VariantEvaluation.getSampleGenotypes() now returns a SampleGenotypes class
Changed VariantAnnotation from implementing Variant to implementing new VariantAnnotations interface
Updated variant coordinates getChromosome(), chromosomeName(), getPosition(), getRef(), getAlt() to
use Svart contigId(), contigName(), start(), end(), ref() and alt() signatures
Replaced RsId with String type in FrequencyData constructors and return from hasDbSnpRsID() method
Replaced Contig class with new Contigs class
VariantAnnotator interface changed to List<VariantAnnotation> annotate(@Nullable Variant variant)
VariantContextSampleGenotypeConverter.createAlleleSampleGenotypes() method now returns a SampleGenotypes object
VariantFactory now a @FunctionalInterface with a createVariantEvaluations()
VariantFactoryImpl now requires VariantAnnotator and VcfReader input arguments
VcfCodecs now requires List rather than Set inputs

New APIs:

New protobuf schemas for Job, Analysis, Sample, OutputOptions
New Exomiser.run(JobProto.Job job) entry point
New FluentAnalysisBuilder interface implemented by AnalysisBuilder and new AnalysisProtoBuilder for consistent
API between proto and domain classes
New AnalysisGroup class extracted from AbstractAnalysisRunner
New Sample class to encapsulate data about the sample, such as Age and Sex
New Age class
New Phenopacket... classes for reading and converting sample data from v1 phenopackets
New Proto converter classes
New SampleIdentifiers class
New SampleData class to contain sampleIdentifier, SampleGenotype and CopyNumber
New SampleGenotypes class to handle
New CopyNumber class for handling copy number variation data from VCF
New AbstractVariant class
New VariantAnnotations interface
New AlleleCall.parseAlleleCall() method
New Pedigree justProband(String id, Individual.Sex sex)) and anscestorsOf(Pedigree.Individual individual)
methods
New SvFrequencyDao, SvPathogenicityDao and SvDaoUtil classes
New VariantWhiteListLoader class
New JannovarAnnotationService.annotateSvGenomeVariant() method
New JannovarSmallVariantAnnotator class
New JannovarStructuralVariantAnnotator class
New TranscriptModelUtil class
New VcfReader interface with VcfFileReader and NoOpVcfReader implementations
New VariantContextConverter class for converting VariantContext objects into Variant

Other changes:

Updated Spring Boot to version 2.5.3
Updated Jannovar to version 0.30
Updated HTSJDK to version 2.24.1
AnalysisResults now hold references to original Sample and Analysis objects
GenomeAnalysisService can now return a VariantAnnotator object
GenomeAssembly now wraps two GenomicAssembly objects
Added ClinVarData starRating() and isSecondaryAssociationRiskFactorOrOther() methods
Added DBVAR, DECIPHER, DGV, GNOMAD_SV and GONL SV FrequencySource
Updated VariantEffectPathogenicityScore to become final and added default inversion score
Numerous small changes to improve performance.

Assets 3

01 Oct 15:21

julesjacobsen

12.1.0

06c90d9

Discovering the ID

This point release is compatible with the 1902, 2003 and 2007 data releases. We recommend you check for the latest data update at https://data.monarchinitiative.org/exomiser/latest/ to keep Exomiser functioning optimally with the latest data.

New features:

The JSON output now shows the id of the variantEvaluation taken from the VCF file.

New APIs:

Added VariantEvaluation.getId() and VariantEvaluation.Builder.id() methods to store VCF id field contents.

Assets 4

22 May 11:05

julesjacobsen

12.0.1

11b8f68

Unifying the disease types

This is a point release to address this issue:

#337 (comment)

There are no other changes.

Assets 3

13 Mar 15:50

julesjacobsen

12.0.0

eeefb36

Up to eleven and one more - new pathogenicity scores and a variant whitelist

CLI changes

This release contains significant diagnostic performance improvements due to the inclusion of a high-quality ClinVar whitelist and 'second generation' pathogenicity scores.

Added new PathogenicitySource sources - M_CAP, MPC, MVP, PRIMATE_AI. Be aware that these may not be free for commercial use. Check the licencing before use!
Added new variant whitelist feature which enables flagging of variants on a whitelist and bypassing of FrequencyFilter and VariantEffectFilter. By default this will use ClinVar variants listed as Pathogenic or Likely_pathogenic and with a review status of criteria provided, single submitter or better. See https://www.ncbi.nlm.nih.gov/clinvar/docs/review_status/ for an explanation of the ClinVar review status.

n.b. This release is incompatible with data release 1811 and below.

Core API

API breaking changes:

Removed FREQUENCY_SOURCE_MAP from FrequencySource
Changed Frequency, RsId and PathogenicityScore static valueOf() constructor to of()
Removed deprecated IntervalFilter.getGeneticInterval()
Changed visibility of PhenodigmMatchRawScore from public to package private and made immutable
Changed visibility of CrossSpeciesPhenotypeMatcher from public to package private and added static of() constructor
Replaced redundant Default*DaoMvStoreProto classes with new AllelePropertiesDaoMvStore
Added OntologyService as constructor argument to AnalysisFactory, AnalysisParser and AnalysisBuilder
Replaced BasePathogenicityScore.compareTo() method with default PathogenicityScore.compareTo()
GeneticInterval no longer accepts ReferenceDictionary as a constructor argument

New APIs:

Added CADD and REMM to data-genome AlleleProperty
Moved JannovarDataSourceLoader from autoconfigure to core module
Added AllelePosition.isSymbolic() method
Added Variant.isCodingVariant() method
Added AnalysisBuilder.addIntervalFilter(Collection<ChromosomalRegion> chromosomalRegions) method
Added new non-public FilterStats class for more accurate filtering statistics
Added new AllelePropertiesDao interface
Added new AllelePropertiesDaoMvStore implementation
Added new AllelePropertiesDaoAdapter to fix issue of Spring cache proxy not being able to intercept internal calls
Added new HpoIdChecker class to return current HPO id/terms for an input id/term
Added new HumanPhenotypeOntologyDao.getIdToPhenotypeTerms() method
Added new OntologyService.getCurrentHpoIds() method
Added new SampleGenotype.isEmpty() method
Added new experimental VcfCodecs class for de/serialising VCF lines
Added new JannovarDataProtoSerialiser.loadProto() method for loading intermediate JannovarProto.JannovarData
Added new VariantWhiteList and InMemoryVariantWhiteList implementation
Added new VariantEvaluation.isWhiteListed() method and relevant builder methods
Added new JannovarDataFactory for a simple programmatic API to build JannovarData objects
Added new TranscriptSource enum
Added new PathogenicityScore.of() static factory constructor
Added new PathogenicityScore.getRawScore() method
Added default PathogenicityScore.compareTo() method
Added new static PathogenicityScore.compare() method
Added new ScaledPathogenicityScore class
Added new MpcScore class
Add new Contig class for converting contig names to integer-based id

Other changes:

Updated Spring Boot to version 2.1.3
Updated Jannovar to version 0.28
Updated HTSJDK to version 2.18.2
Refactored FrequencyData to use array-based backing for 5-10% memory usage improvement and lower GC especially when nearing max memory
Refactored AnalysisParser to utilise AnalysisBuilder directly reducing code duplication
Refactored AnalysisRunner classes to to utilise new FilterStats class
Refactored QueryPhenotypeMatch to store and return input queryPhenotypeMatches argument
Refactored VariantDataServiceImpl to use new AllelePropertiesDao
Refactored VariantDataServiceImpl for better readability and performance
Added check for obsolete HPO id input in AnalysisBuilder.hpoIds()
Re-enabled PhenixPrioritiser in AnalysisParser
Refactored VariantEvaluation.getSampleGenotypeString() implementation to use SampleGenotype instead of VariantContext
Refactored VariantEffectCounter internals with VariantEvaluation calls in place of VariantContext
Enabled flagging of variants on a whitelist and bypassing of FrequencyFilter and VariantEffectFilter
Changed DefaultDiseaseDao to only return diseases marked as having known disease-gene association or copy-number/structural causes
Added range check to BasePathogenicityScore constructor
Updated CaddScore and SiftScore to extend ScaledPathogenicityScore
Updated CaddDao to use CADD phred scaled score directly
Replaced production use of ReferenceDictionary from HG19RefDictBuilder with Contig
Added new PathogenicitySource sources - M_CAP, MPC, MVP, PRIMATE_AI. Be aware that these may not be free for commercial use.

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLI changes

CLI changes

Core API

CLI changes

Core API

Releases: exomiser/Exomiser

New Java version, new database format, smaller data downloads, more ACMG categories, better reporting...

MT codon tables and Bayesian ACMG

SV `<INS>` bugfix

Sometimes it's the little things...

Automated ACMG, p-values, simpler output, documentation!

hg38 only configuration bugfix

CLI changes

Phenopackets and Structural Variants

CLI changes

Core API

Discovering the ID

Unifying the disease types

Up to eleven and one more - new pathogenicity scores and a variant whitelist

CLI changes

Core API