-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NA is coming up as a gene name #51
Comments
@cnobles for what dataset(patient) and where in the report? |
If you want to test, check patient "p04409-10". NA appears in the barcharts, sharing heatmap, and wordclouds. |
indeed it is, I will work on it. |
we have reads mapped to chrM but refSeq gene tables does not have chrM so when getNearestFeature is called for solutions: |
SELECT DISTINCT samples.miseqid FROM All the runs containing sites which match to chrM sites. |
SELECT DISTINCT samples.miseqid, COUNT(miseqid) AS chrM_Hits FROM samples INNER JOIN sites ON samples.sampleID = sites.sampleID WHERE chr = "chrM" GROUP BY miseqid How many chrM hits per run. We should focus on grabbing the reads from the February runs. |
Is it biologically possible to have integration events in the mitochondria? If not where do these "sites" come from? If so, NA or Inf seem like plausible choices for constructing a dataset with filtering done downstream. |
There currently isn't much support for mitochondrial integration, if any. I'm currently looking into where these reads are coming from and what there origins may be. I'll be able to update everyone soon on their legitimacy, or if they are contaminants / artifacts. |
Update: Data suggests a large part of the problem is the LTRbit filtering is too permissive. We looked back in the intSiteLogic.R code and found which line should be changed. We are currently adjusting the parameters to be more stringent and rerunning some of the runs to check performance. |
Another part of the problem is reads that map to chrM but also map (at slightly lower quality) to chr#. This can occur as the DNA of mitochondria has been slowly assimilated into the eukaryotic genome over time, and therefore there are regions of the human genome which highly resemble regions of the mitochondrial genome. |
No description provided.
The text was updated successfully, but these errors were encountered: