Development¶

Please use the PyVCF repository. Pull requests gladly accepted. Issues should be reported at the github issue tracker.

Running tests¶

Please check the tests by running them with:

python setup.py test

New features should have test code sent with them.

Changes¶

0.6.7 Release¶

Include missing .pyx files

0.6.6 Release¶

better walk together record ordering (Thanks @datagram, #141)

0.6.5 Release¶

Better contig handling (#115, #116, #119 thanks Martijn)
INFO lines with type character (#120, #121 thanks @AndrewUzilov, Martijn)
Single breakends fix (#126 thanks @pkrushe)
Speedup by losing ordering of INFO (#128 thanks Martijn)
HOMSEQ and other missing fields in INFO (#130 thanks Martijn)
Add aaf property, (thanks @mgymrek #131)
Custom equality for walk_together, thanks bow #132
Change default line encoding to ‘n’
Improved __eq__ (#134, thanks bow)

0.6.4 Release¶

Handle INFO fields with multiple values, thanks
Support writing records without GT data #88, thanks @bow
Pickleable call data #112, thanks @superbobry
Write files without FORMAT #95 thanks Martijn
Strict whitespace mode, thanks Martijn, Lee Lichtenstein and Manawsi Gupta
Add support for contigs in header, thanks @gcnh and Martijn
Fix GATK header parsing, thanks @alimanfoo

0.6.3 Release¶

cython port of #79
correct writing of meta lines #84

0.6.2 Release¶

issues #78, #79 (thanks Sean, Brad)

0.6.1 Release¶

Add strict whitespace mode for well formed VCFs with spaces in sample names (thanks Marco)
Ignore blank lines in files (thanks Martijn)
Tweaks for handling missing data (thanks Sean)
bcftools tests (thanks Martijn)
record.FILTER is always a list

0.6.0 Release¶

Backwards incompatible change: _Call.data is now a namedtuple (previously it was a dict)
Optional cython version, much improved performance.
Improvements to writer (thanks @cmclean)
Improvements to inheritance of classes (thanks @lennax)

0.5.0 Release¶

VCF 4.1 support: - support missing genotype #28 (thanks @martijnvermaat) - parseALT for svs #42, #48 (thanks @dzerbino)
trim_common_suffix method #22 (thanks @martijnvermaat)
Multiple metadata with the same key is stored (#52)
Writer improvements: - A/G in Number INFO fields #53 (thanks @lennax) - Better output #55 (thanks @cmclean)
Allow malformed INFO fields #49 (thanks @ilyaminkin)
Added bayes factor error bias VCF filter
Added docs on vcf_melt
filters from @libor-m (SNP only, depth per sample, avg depth per sample)
change to the filter API, use docstring for filter description

0.4.6 Release¶

Performance improvements (#47)
Preserve order of INFO column (#46)

0.4.5 Release¶

Support exponent syntax qual values (#43, #44) (thanks @martijnvermaat)
Preserve order of header lines (#45)

0.4.4 Release¶

Support whitespace in sample names
SV work (thanks @arq5x)
Python 3 support via 2to3 (thanks @marcelm)
Improved filtering script, capable of importing local files

0.4.3 Release¶

Single floats in Reader._sample_parser not being converted to float #35
Handle String INFO values when Number=1 in header #34

0.4.2 Release¶

Installation problems

0.4.1 Release¶

Installation problems

0.4.0 Release¶

Package structure
add vcf.utils module with walk_together method
samtools tests
support Freebayes’ non standard ‘.’ for no call
fix vcf_melt
support monomorphic sites, add is_monomorphic method, handle null QUALs
filter support for files with monomorphic calls
Values declared as single are no-longer returned in lists
several performance improvements

0.3.0 Release¶

Fix setup.py for python < 2.7
Add __eq__ to _Record and _Call
Add is_het and is_variant to _Call
Drop aggressive parse mode: we’re always aggressive.
Add tabix fetch for single calls, fix one->zero based indexing
add prepend_chr mode for Reader to add chr to CHROM attributes

0.2.2 Release¶

Documentation release

0.2.1 Release¶

Add shebang to vcf_filter.py

0.2 Release¶

Replace genotype dictionary with a Call object
Methods on Record and Call (thanks @arq5x)
Shortcut parse_sample when genotype is None

0.1 Release¶

Added test code
Added Writer class
Allow negative number in INFO and FORMAT fields (thanks @martijnvermaat)
Prefer vcf.Reader to vcf.VCFReader
Support compressed files with guessing where filename is available on fsock
Allow opening by filename as well as filesocket
Support fetching rows for tabixed indexed files
Performance improvements (see test/prof.py)
Added extensible filter script (see FILTERS.md), vcf_filter.py

Contributions¶

Project started by @jdoughertyii and taken over by @jamescasbon on 12th January 2011. Contributions from @arq5x, @brentp, @martijnvermaat, @ian1roberts, @marcelm.

This project was supported by Population Genetics.