Methylation analysis: P. generosa juvenile OA response

bismark currently running on full data set

mapping and coverage analysis with subsetted samples (10K reads each)

bismark summary
- all samples have > 50% mapping efficiency with Bismark settings –score_min L,0,-1.2 -I 60 –non_directional
- deduplication removed very few reads, per usual
- % methylation is on average 20%

methylkit summary - coverage plots don’t appear to obviously show any particular sample or condition should be excluded due to low coverage.this may become more obvious in the full analysis or downstream analysis
- the genome region > 60000 seems to have low coverage in all samples
plot of genome regions with no coverage organized by length of treatment

C. virginica

bismark run completed on mox

- last part failed because I copied from jupyter notebook and had '!' in the code still
- need to edit this part for future scripts, but for now i will complete the analysis on ostrich in my jupyter notebook
	- generate summary file
	- generate sorted bam files for methylkit analysis
- need to rsync everything onto Metacarcinus from Mox gscratch

failed code

	/var/spool/slurm/d/job394229/slurm_script: line 47: !cat: command not found
	/var/spool/slurm/d/job394229/slurm_script: line 51: !sed: command not found
	/var/spool/slurm/d/job394229/slurm_script: line 56: fg: no job control
	xargs: samtools: No such file or directory
	/var/spool/slurm/d/job394229/slurm_script: line 61: -o: command not found

code excerpt

	46 #create summary report
	47 !cat /gscratch/srlab/strigg/analyses/20181004/*PE_report.txt | \
	48 grep 'Mapping\ efficiency\:' | \
	49 cat - /gscratch/srlab/strigg/analyses/20181004/*.deduplication_report.txt > /gscratch/srlab/strigg/analyses/20181004/zr2096_mapping_dedup_summary.txt
	50 #clean up summary report
	51 !sed 's/Mapping\ efficiency\://g' /gscratch/srlab/strigg/analyses/20181004/zr2096_mapping_dedup_summary.txt | \
	52 sed 's/Total\ number\ duplicated\ alignments\ removed\://g' | \
	53 sed 's/ //g' | awk '{print $1}' > /gscratch/srlab/strigg/analyses/20181004/zr2096_mapping_dedup_summary_clean.txt
	54 
	55 #sort bams
	56 %%bash
	57 find /gscratch/srlab/strigg/analyses/20181004/*deduplicated.bam| \
	58 xargs basename -s _s1_R1_bismark_bt2_pe.deduplicated.bam | \
	59 xargs -I{} samtools \
	60 sort /gscratch/srlab/strigg/analyses/20181004/{}_s1_R1_bismark_bt2_pe.deduplicated.bam \ 
	61 -o /gscratch/srlab/strigg/analyses/20181004/{}_dedup.sorted.bam