-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
There are too few interaction signals to manually adjust them in Juicebox #3
Comments
Sorry for my late reply, The heatmap plotting from Your genome’s high homozygosity may result in most of the mapping quality of pore-c fragments being smaller than 1. cphasing pairs2mnd porec.pairs.gz -o porec.mnd.txt -q 0 Best regards, |
Thank you for your suggestion. We will fix this limitation in the future or add a function to split long contigs. Best regards, |
Dear Dr. Yibin, Best regards, |
Hello,
First of all, thank you very much for developing such an excellent phasing and scaffolding tool. However, I have encountered some issues while using it.
I first assembled hap1.fa and hap2.fa using hifiasm, each approximately 1 GB in size. Then, I merged them into a single file named output.fa. Subsequently, I used the following commands:
cphasing pipeline -f output.fa -pcd porec.fq.gz -t 100 -n 18:2 -hcr
ln -sf cphasing_output/porec.pairs.gz ./
ln -sf cphasing_output/4.scaffolding/groups.agp ./
cphasing pairs2mnd porec.pairs.gz -o porec.mnd.txt
cphasing utils agp2assembly groups.agp > groups.assembly
docker run -i --rm -w ${PWD} -u $(id -u):$(id -g) -v /calculate:/calculate -v /data:/data hic:v2.1 /software/3d-dna-201008/visualize/run-assembly-visualizer.sh groups.assembly porec.mnd.txt
I then loaded the resulting groups.assembly and groups.hic files into Juicebox for manual adjustment, but I found that the interaction signals were too sparse to make adjustments effectively.
![image](https://private-user-images.githubusercontent.com/106056538/398681096-1aed6b40-0568-4a50-8268-364a19f4f5b2.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMjM0NzMsIm5iZiI6MTczOTMyMzE3MywicGF0aCI6Ii8xMDYwNTY1MzgvMzk4NjgxMDk2LTFhZWQ2YjQwLTA1NjgtNGE1MC04MjY4LTM2NGExOWY0ZjViMi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjEyJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMlQwMTE5MzNaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT02M2NhOTdhYWJkNWYxYzlmNDgxNWZlM2RmNzFkMGUyNjdmNWY2MmQzZDJjNTdiNDQ3OGNiOTcxOGQxOTk3NzAxJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.6eb65A5Bf9JOdlT0Xiz3eep5bsZLdqAblBFmZW4KjxM)
Here is a screenshot of the Juicebox view:
Additionally, the scaffolding plot generated directly by cphasing (without manual adjustment) looks as follows:
![image](https://private-user-images.githubusercontent.com/106056538/398681751-cd3d26e7-d6ed-4b27-8060-7619f78b71a8.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzMjM0NzMsIm5iZiI6MTczOTMyMzE3MywicGF0aCI6Ii8xMDYwNTY1MzgvMzk4NjgxNzUxLWNkM2QyNmU3LWQ2ZWQtNGIyNy04MDYwLTc2MTlmNzhiNzFhOC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjEyJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxMlQwMTE5MzNaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1hZWY1NzRjZGIzN2NkYzZjNzM0MGE1ZWVlZTNkYzMyYmIzNTY1ODVjOWJiZDJhZTU1MDZiYWEzNjBlYjdkZWYzJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.d3qAA3nkhsIiPwaghVYYwM-obUl26o3dfituiZ-YqQ4)
The results seem excellent, and I only need to adjust a few contigs to achieve a nearly perfect phased genome. However, due to the low signal visibility in Juicebox, I am unable to complete this task.
As mentioned earlier, the merged genome is 2 GB in size, and the porec data is only about 25 GB. I wonder if the low data volume in phasing mode is causing the weak signal display in Juicebox, or if there is any way to enhance the signal visibility in Juicebox to help me adjust misassembled contigs. Alternatively, what amount of porec data would be required for this genome to display strong interaction signals in Juicebox
Thanks!
The text was updated successfully, but these errors were encountered: