Writing an R function to read FASTA-formatted files (CC289)

Описание к видео Writing an R function to read FASTA-formatted files (CC289)

Watch along as Pat shows the development of a function for reading in a FASTA-formatted file. FASTA files are commonly used to store DNA sequence information. He uses Test Driven Development (TDD) to develop and refactor the function to suit our needs. Along the way, he makes use of temporary files using the tempfile function and a variety of functions from the stringi package that stringr is based on. This episode is part of an ongoing effort to develop an R package that implements the naive Bayesian classifier.

If you want to get a physical copy of R Packages: https://amzn.to/43pMR8L
If you want a free, online version of R packages: https://r-pkgs.org/

You can find my blog post for this episode at https://www.riffomonas.org/code_club/....

Check out the GitHub repository at the:
* Beginning of the episode: https://github.com/riffomonas/phyloty...
* End of the episode: https://github.com/riffomonas/phyloty...


#rstats #paste #paste0 #refactor #testthat #tdd #microbenchmark #vectors #rdp #16S #classification #classifier #microbialecology #microbiome

Support Riffomonas by becoming a Patreon member!
  / riffomonas  

Want more practice on the concepts covered in Code Club? You can sign up for my weekly newsletter at https://shop.riffomonas.org/youtube to get practice problems, tips, and insights.

If you're interested in purchasing a video workshop be sure to check out https://riffomonas.org/workshops/

You can also find complete tutorials for learning R with the tidyverse using...
Microbial ecology data: https://www.riffomonas.org/minimalR/
General data: https://www.riffomonas.org/generalR/



0:00 Introduction
7:19 Working with temporary files
10:36 Basic version of read_fasta
15:37 Parsing sequence names when comments are present
21:41 Parsing comments from fasta header line
24:01 Reading in multiple lines of sequence data
37:08 Using read_fasta in vignette

Комментарии

Информация по комментариям в разработке