slackbuilds/academic/pyCRAC
Heinz Wiesinger 63daf9f79a All: Support $PRINT_PACKAGE_NAME env var
Signed-off-by: Heinz Wiesinger <pprkut@slackbuilds.org>
2021-07-17 21:55:09 +02:00
..
README academic/pyCRAC: Updated for version 1.5.0. 2021-02-27 08:24:29 +07:00
README.tests academic/pyCRAC: Updated for version 1.5.0. 2021-02-27 08:24:29 +07:00
pyCRAC.SlackBuild All: Support $PRINT_PACKAGE_NAME env var 2021-07-17 21:55:09 +02:00
pyCRAC.info academic/pyCRAC: Fixed dep info 2021-04-18 23:22:23 +02:00
setup_slack.py academic/pyCRAC: Updated for version 1.5.0. 2021-02-27 08:24:29 +07:00
slack-desc academic/pyCRAC: Added (Next generation sequencing analysis). 2017-11-03 23:18:37 +00:00
test_slack.sh academic/pyCRAC: Updated for version 1.4.4. 2019-04-13 06:29:59 +07:00

README

The pyCRAC package is a collection of python scripts to analyse high
throughput data generated by RNA-sequencing, especially of molecules
crosslinked by UV to an immunoprecipitated protein of interest (i.e.
data generated by CLIP or CRAC protocols).
It can be used to remove duplicate reads,tackles directional libraries
and reports sense and anti-sense hits.

Included is the pipeline used for the analysis of a group of CRAC data
sets.


References

Genome Biol. 2014 Jan 7;15(1):R8. doi: 10.1186/gb-2014-15-1-r8.
PAR-CLIP data indicate that Nrd1-Nab3-dependent transcription
termination regulates expression of hundreds of protein coding genes in
yeast. Webb S, Hector RD, Kudla G, Granneman S.

Nature Communications, 2017; DOI: 10.1038/s41467-017-00025-5
Kinetic CRAC uncovers a role for Nab3 in determining gene expression
profiles during stress. van Nues R, Schweikert G, de Leau   E, Selega
A, Langford A, Franklin R, Iosub I, Wadsworth P, Sanguinetti G,
Granneman S.

If you want to run the test suite after installation, see README.tests.


Note on the Crac pipelines:

Use the -h flag to get a detailed help menu.

The CRAC_pipeline_PE.py script needs to be run from the folder that
contains the fastq files

The barcode list file should contain two tab-separated columns in which
the first column is the barcode sequence and the second column is the
name of the experiment

The file containing the adapter sequences should be in the fasta format.

The chromosome_lengths file should contain two tab-separated columns in
which the first column has the chromosome name and the second the
chromosome length.