This release adds comprehensive support for bioinformatics and computational biology workflows, with enhancements specifically designed for NGS analysis, HPC environments, and large-scale data processing.
External Tool Version Tracking
track_external_tools(): Track versions of command-line tools (samtools, STAR, BWA, etc.)get_tool_versions(): Retrieve tracked tool versionsConda/Mamba Environment Support
track_conda_env(): Export and track conda environmentsrestore_conda_env(): Restore conda environments from YAMLget_conda_env_info(): Retrieve conda environment informationReference Genome Tracking
track_reference_genome(): Track reference genomes, annotations, and indicesget_reference_info(): Retrieve reference genome informationlist_reference_sources(): Display common reference genome sourcesLarge File Handling
track_data() with smart checksumming for large files (>1GB)System Library Detection
capture_system_libraries(): Detect system library versionsHardware Information Capture
capture_hardware(): Capture CPU, RAM, and GPU specificationsgenerate_singularity(): Generate Singularity definition filesexport_for_nextflow(): Export data for Nextflow pipelinesexport_for_snakemake(): Export data for Snakemake workflowsexport_for_wdl(): Export data for WDL workflowsexport_for_cwl(): Export data for CWL workflowscompare_snapshots(): Compare two workflow snapshotslist_snapshots(): List all available snapshots with metadataSession Tracking: Comprehensive R session information capture
capture_session(): Capture R version, platform, and system infocapture_environment(): Capture global environment statePackage Management: Complete package version tracking
snapshot_packages(): Create detailed package manifestscreate_renv_lockfile(): Generate renv lockfilesData Provenance: Track data files with integrity verification
track_data(): Record data source, checksums, and metadataverify_data(): Verify data integrity via SHA-256 checksumsget_data_lineage(): Retrieve complete data provenanceParameter Tracking: Document analysis parameters
track_params(): Store analysis parameters with metadataget_param_history(): Retrieve parameter historyRandom Seed Management: Reproducible random number generation
set_seed(): Set and track random seedsrestore_seed(): Restore previously tracked seedsScript Generation: Create reproducible analysis scripts
generate_repro_script(): Generate executable R scriptscreate_repro_report(): Generate markdown reportsDocker Support: Containerization for perfect reproducibility
generate_docker(): Generate Dockerfile and docker-compose.ymlWorkflow Management: Complete workflow orchestration
init_capsule(): Initialize Capsule in projectssnapshot_workflow(): Create complete workflow snapshots