Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with group and dedup option #660

Open
LioLenr opened this issue Aug 20, 2024 · 0 comments
Open

Issue with group and dedup option #660

LioLenr opened this issue Aug 20, 2024 · 0 comments

Comments

@LioLenr
Copy link

LioLenr commented Aug 20, 2024

Hello !

I am trying to detect viral transcript in RNA sec data (obtained with BD Rhapsody Enhanced bead 3').
In order to do that I am using UMI-tools to deal with the UMI and cell barcode and star aligner for the alignement.
My plan is to :
1 - Create a whitelist from the raw read using umi-tools -whitelist and then use umi-tools extract to clean my reads
2 - Create two index, one for the host and one for the virus with STAR and then align my processed reads with those index separately, again using STAR.
3 - Group and deduplicate the BAM files generated from step 2 with umi-tools group and dedup
4 - Generate two count matrix with umi-tools count with this shape cell_id X Host genes/Viral genes
5 - Merge the two count matrix using the cell_id as "anchor" point.

That is the plan.

Nevertheless, I am struggling to group and dedup the BAM files generated from the alignement of the reads with the viral index, not for the host index. I suppose that the really small size of the viral genome and index compared to the read size could be an issue.

I wanted to know if you can see other potentials issues and how to resolve them ?
Furthermore if you have done something related to what I am trying to do could you suggest me some other tool, approach or change to my approach ?

Best regards,

Lionel Lenoir

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant