📄 retrieveseq.txt
字号:
*** GENOME EXPLORER HELP FILE ***To provide help on using the GUI, and information about how the programs runContents1) Outline of Function2) Parameters loaded from the .inf file (settings menu)3) The User Interface4) Underlying Method--- RETRIEVE SEQ ------ 1) Outline of Function ---To retrieve sequences from EITHER a fastafile, OR a blastable database,using description line searches / fastacmd (accession id search), and write them to a fasta fileTo search a blastable database a) the database MUST be indexed b) searches performed by fastacmd, so only exact matches to accession id returnedTo search a fasta file a) convert to a blastable database, then see conditions above b) search for exact matches to the entire description line c) search for matches to accession id (the part of the description line before the first space character) d) search for substring matches anywhere within the description line--- 2) Parameters loaded from the .inf file (settings menu) ---retrieveSeqInDir directory in which to open input file chooserretrieveSeqOutDir default directory in which to write output filesblastDatabaseDir directory in which to open database file chooserretrieveSeqOutfileExt extension for output files (.fasta)blastTempDbFilename name for temporary database if one needs to be written (dbtemp)blastTempDbDir directory in which to write temporary database fastacmdExePath path to fastacmd executableformatdbExePath path to formatdb executablefastacmdTempFilename name (not path) for fastacmd temp file (fastacmd.tmp)fastacmdTempDir directory in which to write fastacmd temp file--- 3) The User Interface ---select database/fastafile Use the browse button to select file - that determines the type of file selected and enables other components accordinglyselect blast output directory Use the browse button to select a directory to which to write all output filesoutfile basename (or tick checkbox) Enter a basename for outfiles - they will then be numbered incrementally and have an extension added. OR Tick the checkbox to use the infile (the file of accession ids) name as the outfile basename, and just add an extension. This option is not available when entering the query as a string.select whether to search for accession ids from a string or from files Which radio button is selected effect which other components are enabled. for example, if ids as string is selected, the file chooser is disabled.select files containing accession ids to search for use the add / remove buttons to add / remove files to / from the list box. all files listed when 'ok' is hit will be runenter a comma separated list of accession ids to search for if searching a database these MUST be accession ids. If searching a fasta file (as a fasta file) search strings can mach to any part of the description line, depending on the search option chosenconvert the fasta file to be searched to a blastable database, then use fastacmd converts the fasta file before doing the search - can be quicker for large files but confines searches to accession id onlyselect type of comparison only an option if searching against a fasta file a) entire name: match the entire description line b) accession ID: match all characters up to the first white space character in the description line c) substring within name: match to a substring within the description linedb/fastafile contains PROTEIN sequences tick the box if you're searching against file/database of proteinsif fastacmd search, get duplicate sequences with dame id one of the command line arguments for fastacmdenter fasta line length (for output file) the number of characters on a single line of fasta output (80)--- 4) Underlying Method ---If search is vs a blastable database - uses fastacmdIf search file is in fasta format, but the user requests it, it is converted toa blastable database using formatdb, and then searched using fastacmdIf search file is a fasta file, it is loaded into memory in serviceable chunks, and each description line searched using String.substring and String.equals methodssupplied with Java.
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -