Skip to content

Commit bbc4f0b

Browse files
committed
Arriba 2.2.0
1 parent 2de486c commit bbc4f0b

File tree

6 files changed

+19
-19
lines changed

6 files changed

+19
-19
lines changed

Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ RUN wget -qO - 'https://github.com/alexdobin/STAR/archive/2.7.10a.tar.gz' | \
1212
tar --strip-components=3 -C /usr/local/bin -xzf - 'STAR-2.7.10a/bin/Linux_x86_64/STAR'
1313

1414
# install arriba
15-
RUN wget -qO - 'https://github.com/suhrig/arriba/releases/download/v2.1.0/arriba_v2.1.0.tar.gz' | tar -xzf - --exclude='arriba*/.git'
15+
RUN wget -qO - 'https://github.com/suhrig/arriba/releases/download/v2.2.0/arriba_v2.2.0.tar.gz' | tar -xzf - --exclude='arriba*/.git'
1616

1717
# make wrapper script for download_references.sh
1818
RUN echo '#!/bin/bash\n\

LICENSE

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ License of arriba (code & executable), documentation, and database files:
44

55
The MIT/Expat License
66

7-
Copyright (C) 2016-2021 Sebastian Uhrig ([email protected])
7+
Copyright (C) 2016-2022 Sebastian Uhrig ([email protected])
88

99
Permission is hereby granted, free of charge, to any person obtaining a copy
1010
of this software and associated documentation files (the "Software"), to deal

documentation/quickstart.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -6,9 +6,9 @@ Arriba has only a single prerequisite: [STAR](https://github.com/alexdobin/STAR)
66
Compile the latest stable version of Arriba or use the precompiled binaries in the download file. **Note: You should not use `git clone` to download Arriba, because the git repository does not include the blacklist! Instead, download the latest tarball from the [releases page](https://github.com/suhrig/arriba/releases/) as shown here:**
77

88
```bash
9-
wget https://github.com/suhrig/arriba/releases/download/v2.1.0/arriba_v2.1.0.tar.gz
10-
tar -xzf arriba_v2.1.0.tar.gz
11-
cd arriba_v2.1.0 && make # or use precompiled binaries
9+
wget https://github.com/suhrig/arriba/releases/download/v2.2.0/arriba_v2.2.0.tar.gz
10+
tar -xzf arriba_v2.2.0.tar.gz
11+
cd arriba_v2.2.0 && make # or use precompiled binaries
1212
```
1313

1414
Arriba requires an assembly in FastA format, gene annotation in GTF format, and a STAR index built from the two. You can use your preferred assembly and annotation, as long as their coordinates are compatible with hg19/hs37d5/GRCh37 or hg38/GRCh38 or mm10/GRCm38 or mm39/GRCm39. If you use another assembly, then the coordinates in the blacklist will not match and the predictions will contain many false positives. GENCODE annotation is recommended over RefSeq due to more comprehensive annotation of immunoglobulin/T-cell receptor loci and splice sites, which improves sensitivity. If you do not already have the files and a STAR index, you can use the script `download_references.sh`. It downloads the files to the current working directory and builds a STAR index. Run the script without arguments to see a list of available files. Choose a file with the keyword `viral` if Arriba is supposed to detect viral integration sites. Note that this step requires ~45 GB of RAM and 8 cores (can be adjusted by setting the environment variable `THREADS`).
@@ -22,7 +22,7 @@ The download file contains a script `run_arriba.sh`, which demonstrates the usag
2222
Run the demo script with 8 threads. In case of single-end data, the second FastQ file is omitted.
2323

2424
```bash
25-
./run_arriba.sh STAR_index_hs37d5viral_GENCODE19/ GENCODE19.gtf hs37d5viral.fa database/blacklist_hg19_hs37d5_GRCh37_v2.1.0.tsv.gz database/known_fusions_hg19_hs37d5_GRCh37_v2.1.0.tsv.gz database/protein_domains_hg19_hs37d5_GRCh37_v2.1.0.gff3 8 test/read1.fastq.gz test/read2.fastq.gz
25+
./run_arriba.sh STAR_index_hs37d5viral_GENCODE19/ GENCODE19.gtf hs37d5viral.fa database/blacklist_hg19_hs37d5_GRCh37_v2.2.0.tsv.gz database/known_fusions_hg19_hs37d5_GRCh37_v2.2.0.tsv.gz database/protein_domains_hg19_hs37d5_GRCh37_v2.2.0.gff3 8 test/read1.fastq.gz test/read2.fastq.gz
2626
```
2727

2828
Installation using Docker
@@ -33,7 +33,7 @@ Install [Docker](https://www.docker.com/) according to the developers' instructi
3333
Run the script `download_references.sh` inside the Docker container. It downloads the assembly and gene annotation to the directory `/path/to/references` and builds a STAR index. Run the script without arguments to see a list of available files. Choose a file with the keyword `viral` if Arriba is supposed to detect viral integration sites. Note that this step requires ~45 GB of RAM and 8 cores (can be adjusted by passing the parameter `--env=THREADS=...`).
3434

3535
```bash
36-
docker run --rm -v /path/to/references:/references uhrigs/arriba:2.1.0 download_references.sh hs37d5viral+GENCODE19
36+
docker run --rm -v /path/to/references:/references uhrigs/arriba:2.2.0 download_references.sh hs37d5viral+GENCODE19
3737
```
3838

3939
Use the following Docker command to run Arriba from the container. Replace `/path/to/` with the path to the respective input file. Leave the paths after the colons unmodified - these are the paths inside the Docker container. In case of single-end data, the second FastQ file is omitted. Running Arriba requires ~45 GB of RAM and 8 cores (can be adjusted by passing the parameter `--env=THREADS=...`).
@@ -44,7 +44,7 @@ docker run --rm \
4444
-v /path/to/references:/references:ro \
4545
-v /path/to/read1.fastq.gz:/read1.fastq.gz:ro \
4646
-v /path/to/read2.fastq.gz:/read2.fastq.gz:ro \
47-
uhrigs/arriba:2.1.0 \
47+
uhrigs/arriba:2.2.0 \
4848
arriba.sh
4949
```
5050

@@ -57,7 +57,7 @@ The Docker container is compatible with Singularity. If desired, it can be conve
5757

5858
```bash
5959
mkdir /path/to/references
60-
singularity exec -B /path/to/references:/references docker://uhrigs/arriba:2.1.0 download_references.sh hs37d5viral+GENCODE19
60+
singularity exec -B /path/to/references:/references docker://uhrigs/arriba:2.2.0 download_references.sh hs37d5viral+GENCODE19
6161
```
6262

6363
Use the following Singularity command to run Arriba from the container. Replace `/path/to/` with the path to the respective input file. Leave the paths after the colons unmodified - these are the paths inside the Singularity container. In case of single-end data, the second FastQ file is omitted. Running Arriba requires ~45 GB of RAM and 8 cores (can be adjusted by setting the environment variable `SINGULARITYENV_THREADS`).
@@ -68,7 +68,7 @@ singularity exec \
6868
-B /path/to/references:/references:ro \
6969
-B /path/to/read1.fastq.gz:/read1.fastq.gz:ro \
7070
-B /path/to/read2.fastq.gz:/read2.fastq.gz:ro \
71-
docker://uhrigs/arriba:2.1.0 \
71+
docker://uhrigs/arriba:2.2.0 \
7272
arriba.sh
7373
```
7474

@@ -80,7 +80,7 @@ Install [Miniconda](https://conda.io/) according to the developers' instructions
8080
Install the `arriba` package:
8181

8282
```bash
83-
conda install -c conda-forge -c bioconda arriba=2.1.0
83+
conda install -c conda-forge -c bioconda arriba=2.2.0
8484
```
8585

8686
Run the script `download_references.sh`, which is installed inside the conda environment. It downloads the assembly and gene annotation to the current working directory and builds a STAR index. Run the script without arguments to see a list of available files. Choose a file with the keyword `viral` if Arriba is supposed to detect viral integration sites. Note that this step requires ~45 GB of RAM and 8 cores (can be adjusted by setting the environment variable `THREADS`). Replace `$CONDA_PREFIX` with the path to your conda environment.
@@ -93,7 +93,7 @@ To process FastQ files, run the script `run_arriba.sh`, which is installed insid
9393

9494
```bash
9595
ARRIBA_FILES=$CONDA_PREFIX/var/lib/arriba
96-
run_arriba.sh STAR_index_hs37d5viral_GENCODE19/ GENCODE19.gtf hs37d5viral.fa $ARRIBA_FILES/blacklist_hg19_hs37d5_GRCh37_v2.1.0.tsv.gz $ARRIBA_FILES/known_fusions_hg19_hs37d5_GRCh37_v2.1.0.tsv.gz $ARRIBA_FILES/protein_domains_hg19_hs37d5_GRCh37_v2.1.0.gff3 8 $ARRIBA_FILES/read1.fastq.gz $ARRIBA_FILES/read2.fastq.gz
96+
run_arriba.sh STAR_index_hs37d5viral_GENCODE19/ GENCODE19.gtf hs37d5viral.fa $ARRIBA_FILES/blacklist_hg19_hs37d5_GRCh37_v2.2.0.tsv.gz $ARRIBA_FILES/known_fusions_hg19_hs37d5_GRCh37_v2.2.0.tsv.gz $ARRIBA_FILES/protein_domains_hg19_hs37d5_GRCh37_v2.2.0.gff3 8 $ARRIBA_FILES/read1.fastq.gz $ARRIBA_FILES/read2.fastq.gz
9797
```
9898

9999
Output files

documentation/visualization.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -52,8 +52,8 @@ The following command demonstrates the usage. Please refer to section [Command-l
5252
--alignments=Aligned.sortedByCoord.out.bam \
5353
--output=fusions.pdf \
5454
--annotation=GENCODE19.gtf \
55-
--cytobands=database/cytobands_hg19_hs37d5_GRCh37_v2.1.0.tsv \
56-
--proteinDomains=database/protein_domains_hg19_hs37d5_GRCh37_v2.1.0.gff3
55+
--cytobands=database/cytobands_hg19_hs37d5_GRCh37_v2.2.0.tsv \
56+
--proteinDomains=database/protein_domains_hg19_hs37d5_GRCh37_v2.2.0.gff3
5757
```
5858

5959
**Execution via Docker**
@@ -67,7 +67,7 @@ docker run --rm \
6767
-v /path/to/fusions.tsv:/fusions.tsv:ro \
6868
-v /path/to/Aligned.sortedByCoord.out.bam:/Aligned.sortedByCoord.out.bam:ro \
6969
-v /path/to/Aligned.sortedByCoord.out.bam.bai:/Aligned.sortedByCoord.out.bam.bai:ro \
70-
uhrigs/arriba:2.1.0 \
70+
uhrigs/arriba:2.2.0 \
7171
draw_fusions.sh
7272
```
7373

@@ -82,7 +82,7 @@ singularity exec \
8282
-B /path/to/fusions.tsv:/fusions.tsv:ro \
8383
-B /path/to/Aligned.sortedByCoord.out.bam:/Aligned.sortedByCoord.out.bam:ro \
8484
-B /path/to/Aligned.sortedByCoord.out.bam.bai:/Aligned.sortedByCoord.out.bam.bai:ro \
85-
docker://uhrigs/arriba:2.1.0 \
85+
docker://uhrigs/arriba:2.2.0 \
8686
draw_fusions.sh
8787
```
8888

download_references.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -89,9 +89,9 @@ fi > "$ASSEMBLY$VIRAL.fa"
8989

9090
if [ "$VIRAL" = "viral" ]; then
9191
echo "Appending RefSeq viral genomes"
92-
REFSEQ_VIRAL_GENOMES=$(dirname "$0")/RefSeq_viral_genomes_v2.1.0.fa.gz
92+
REFSEQ_VIRAL_GENOMES=$(dirname "$0")/RefSeq_viral_genomes_v2.2.0.fa.gz
9393
if [ ! -e "$REFSEQ_VIRAL_GENOMES" ]; then
94-
REFSEQ_VIRAL_GENOMES=$(dirname "$0")/database/RefSeq_viral_genomes_v2.1.0.fa.gz
94+
REFSEQ_VIRAL_GENOMES=$(dirname "$0")/database/RefSeq_viral_genomes_v2.2.0.fa.gz
9595
fi
9696
gunzip -c "$REFSEQ_VIRAL_GENOMES" >> "$ASSEMBLY$VIRAL.fa"
9797
fi

source/options.hpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ const string HELP_CONTACT = "https://github.com/suhrig/arriba/issues";
1212
const string USER_MANUAL = "https://arriba.readthedocs.io/";
1313
const string CODE_REPOSITORY = "https://github.com/suhrig/arriba";
1414
const string CITATION = "https://doi.org/10.1101/gr.257246.119";
15-
const string ARRIBA_VERSION = "2.1.0";
15+
const string ARRIBA_VERSION = "2.2.0";
1616

1717
string wrap_help(const string& option, const string& text, const unsigned short int max_line_width = 80);
1818

0 commit comments

Comments
 (0)