A short script to use the nontarget pattern.match function to print useful information about clusters.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

88 lines
2.3 KiB

3 years ago
3 years ago
3 years ago
3 years ago
3 years ago
3 years ago
  1. # Specifications for the script
  2. **Use the Nontarget**
  3. - Clusters should have large enough number of fragments (3+ maybe)
  4. - nontarget has a pattern match function.
  5. - Check for chlorinated compounds (has n+2, n+4, ...)
  6. - GPL License (good)
  7. - Script to get output for all of the results
  8. - tabbed-deliminated text file
  9. - Which clusters contain chlorinated compounds, how many fragments, what other
  10. isotopes.
  11. ## format
  12. - Readable by R.
  13. - Manipulatable by Tidyverse
  14. - Include it in script?
  15. # Additional Notes
  16. - Independent of the ordering (it might get corrupted)
  17. - Just search for chlorinated/brominated compounds.
  18. - Use intensity + relative abundances that `isos` shows.
  19. - usually, use_charges will be 1
  20. - Headers to use in script.
  21. - Packages to do things in parallel (multiple threads)
  22. - be able to configure the number of cores or threads used.
  23. - make it a requirement to be tab-deliminated.
  24. - eliminate clusters with only m/z.
  25. # Notes
  26. - cutint represents high enough intensity.
  27. - mztol is in terms of ppm.
  28. - mzfrac is in terms of absolute.
  29. - PPM = true always
  30. - First pass - (identify all of the fragments where chlorinated/brominated
  31. compounds are found and carbon)
  32. - If isotopes exist, which ones have chlorines/bromines...
  33. - return a data frame.
  34. - No statistical testing required (check if it is chlorine or bromine)
  35. - Isotopes to check (Carbon, Sulfur, Chlorine, Bromine all have n+2) (check make.isos object)
  36. - Tweak parameters (but also put them into a slideshow/record, see how good the results are).
  37. - **set all rules to false**
  38. ## Isotopes
  39. - These isotopes will tend to have different intensities
  40. - Part of the reasoning can be attributed to differences.
  41. - Forseeable future, adapt to different isotopes.
  42. - Capture multiple outputs of different isotopes.
  43. ## Data Summary
  44. Per row:
  45. - Fragment number
  46. - Number of peaks
  47. - top 5 (configurable) most intense m/z values.
  48. - Averaged time output
  49. ### Derivations for above
  50. - Fragment number - trivial
  51. - Number of each isotope (check group counts per isotope)
  52. - most intense - loop through the fragment and isolate. (Preserve all three categories)
  53. - Averaged time output.
  54. ## Output num 2
  55. - each cluster of elements, dump into a file (cluster number).
  56. # Test Notes
  57. - mztol seems to have a big impact on chlorine/bromine positives.