1. Zhu S. The Loaches of the Subfamily Nemacheilinae in China (Cypriniformes: Cobitidae)。. Jiangsu Science & Technology Publishing House.; 1989.
2. Liu Z, Wen H, Hailer F, Dong F, Yang Z, Liu T, et al. Pseudogenization of Mc1r gene associated with transcriptional changes related to melanogenesis explains leucistic phenotypes in Oreonectes cavefish (Cypriniformes, Nemacheilidae). J Zool Syst Evol Res. 2019;900.0-909.
3. Deng H, Xiao N, Hou X, Zhou J. A new species of the genus Oreonectes (Cypriniformes: Nemacheilidae) from Guizhou, China. Zootaxa. 2016;143.0-150.
4. Parra G, Bradnam K, Korf I. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics. 2007;1061–7.
5. Simão AF, Waterhouse MR, Ioannidis P, Kriventseva VE, Zdobnov ME. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;3210.0-3212.0.
6. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012;80–92.
7. Huang WD, Sherman TB, Lempicki AR. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;44.0-57.0.
8. Ogawa Y, Shiraki T, Asano Y, Muto A, Kawakami K, Suzuki Y, et al. Six6 and Six7 coordinately regulate expression of middle-wavelength opsins in zebrafish. Proc Natl Acad Sci U S A. 2019;
9. Pittlik S, Domingues S, Meyer A, Begemann G. Expression of zebrafish aldh1a3 (raldh3) and absence of aldh1a1 in teleosts. Gene Expr Patterns. 2008;141–7.
10. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;1754–60.
11. Xu H, Luo X, Qian J, Pang X, Song J, Qian G, et al. FastUniq: a fast de novo duplicates removal tool for paired short reads. PloS One. 2012;e52249–e52249.
12. Koren S, Walenz PB, Berlin K, Miller RJ, Bergman HN, Phillippy MA. Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation. Genome Res. 2017;722–36.
13. Walker JB, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PloS One. 2014;e112963–e112963.
14. Xu Z, Wang H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 2007;W265-8.
15. Han Y, Wessler RS. MITE-Hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences. Nucleic Acids Res. 2010;e199–e199.
16. Price LA, Jones CN, Pevzner AP. De novo identification of repeat families in large genomes. ISMB Suppl Bioinforma. 2005;351–8.
17. Edgar CR, Myers WE. PILER: identification and classification of genomic repeats. ISMB Suppl Bioinforma. 2005;152–8.
18. Wicker T, Sabot F, Hua-Van A, Bennetzen LJ, Capy P, Chalhoub B, et al. A unified classification system for eukaryotic transposable elements. Nat Rev Genet. 2007;973–82.
19. Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 2005;462–7.
20. Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinforma. 2004;Unit 4.10-Unit 4.10.
21. Burge C, Karlin S. Prediction of complete gene structures in human genomic DNA. J Mol Biol. 1997;78–94.
22. Stanke M, Waack S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics. 2003;II215–25.
23. Majoros HW, Pertea M, Salzberg S. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics. 2004;2878–9.
24. Blanco E, Parra G, Guigó R. Using geneid to identify genes. Curr Protoc Bioinforma. 2007;Unit 4.3-Unit 4.3.
25. Korf I. Gene finding in novel genomes. BMC Bioinformatics. 2004;59–59.
26. Keilwagen J, Wenk M, Erickson LJ, Schattat HM, Grau J, Hartung F. Using intron position conservation for homology-based gene prediction. Nucleic Acids Res. 2016;
27. Altschul FS, Gish W, Miller CW, Myers WE, Lipman JD. Basic local alignment search tool. J Mol Biol. 1997;403–10.
28. Campbell AM, Haas JB, Hamilton PJ, Mount MS, Buell RC. Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. BMC Genomics. 2006;327–327.
29. Kim D, Langmead B, Salzberg LS, Langmead B, Langmead B. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;357-U121.
30. Pertea M, Pertea MG, Antonescu MC, Salzberg LS, Antonescu MC, Chang T-C, et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;290-+.
31. Haas B, Zimmermann B, Crusoe MR, MacManes M, Plessy C. TransDecoder (Find Coding Regions Within Transcripts). 2015. Available from: https://github.com/TransDecoder/TransDecoder.wiki.git
32. Tang S, Lomsadze A, Borodovsky M. Identification of protein coding regions in RNA transcripts. Nucleic Acids Res. 2015;e78–e78.
33. Haas JB, Salzberg LS, Zhu W, Pertea M, Allen EJ, Orvis J, et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 2008;R7–R7.
34. Marchler-Bauer A, Lu S, Anderson BJ, Chitsaz F, Derbyshire KM, DeWeese-Scott C, et al. CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res. 2011;D225-9.
35. Tatusov LR, Natale AD, Garkavtsev VI, Tatusova AT, Shankavaram TU, Rao SB, et al. The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 2001;22–8.
36. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;27–30.
37. Boeckmann B, Bairoch A, Apweiler R, Blatter M-C, Estreicher A, Gasteiger E, et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003;365–70.
38. Conesa A, Götz S, García-Gómez MJ, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;3674–6.
39. Lowe MT, Eddy RS. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;955–64.
40. Nawrocki PE, Eddy RS. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics. 2013;2933–5.
41. Griffiths-Jones S, Moxon S, Marshall M, Khanna A, Eddy RS, Bateman A. Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 2005;D121-4.
42. Griffiths-Jones S, Grocock JR, Dongen van S, Bateman A, Enright JA. miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2006;D140-4.
43. Wheeler LD, Church MD, Lash EA, Leipe DD, Madden LT, Pontius UJ, et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2018;6–17.
44. Altschul FS, Madden LT, Schäffer AA, Zhang J, Zhang Z, Miller CW, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Faseb J. 1998;A1326–A1326.
45. Fischer S, Brunk PB, Chen F, Gao X, Harb SO, Iodice BJ, et al. Using OrthoMCL to assign proteins to OrthoMCL-DB groups or to cluster proteomes into new ortholog groups. Curr Protoc Bioinforma. 2011;Unit 6.12.1-19.
46. Edgar CR. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;1792–7.
47. Bie DT, Cristianini N, Demuth PJ, Hahn WM. CAFE: a computational tool for the study of gene family evolution. Bioinformatics. 2006;1269–71.
48. Heng L, Bob H, Alec W, Tim F, Jue R, Nils H, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;2078–9.
49. Picard toolkit. Broad Institute, GitHub repository. Broad Institute; 2019. Available from: https://broadinstitute.github.io/picard/
50. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;1297–303.
51. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;1312–3.
52. Harris R. Improved Pairwise Alignment of Genomic DNA [PhD Thesis]. ProQuest. 2007.