⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 readme

📁 C-package of "Long Short-Term Memory" for Protein classification
💻
字号:
We offer a first C-package of "Long Short-Term Memory" for Proteinclassification (LSTM_protein).:: LicenseThis programm is freely available for academic, non-profit users andopen-source developers under the GNU General Public License (GPL).Commercial users please contact secretary@bioinf.jku.at to get acommercial license.This program is distributed in the hope that it will be useful, but WITHOUTANY WARRANTY; without even the implied warranty of MERCHANTABILITYor FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public Licensefor more details. You should have received a copy of the GNU General PublicLicense along with this program; if not, write to the Free Software Foundation, Inc.,51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA.:: CitationPlease cite:Sepp Hochreiter, Martin Heusel, and Klaus Obermayer."Fast Model-based Protein Homology Detection without Alignment."Bioinformatics 2007; doi: 10.1093/bioinformatics/btm247.:: UsageThis release of the LSTM classifier for protein sequences contains the sourcesfor compiling the LSTM binary as well three directories for three experiments.:: SCOP 1.53 benchmarkThe 'SCOP1.53Experiment' directory holds the complete SCOP benchmark accordingly to'Fast Model-based Protein Homology Detection without Alignment', Bioinformatics 2007,S. Hochreiter, M. Heusel and K. Obermayer.To run the SCOP benchmark the datasets it's necessary to download the archive'datasetsSCOP1.53.tar.gz' or 'datasetsSCOP1.53.tar.bz2' and unpack it in the'SCOP1.53Experiment' directory.:: Quickstart experimentThe 'quickstartExperiment' holds a quickstart experiment where one can get a quickoverview of the LSTM by running one classification experiment with one class ofthe SCOP benchmark mentioned above.:: Individual experimentsIn the 'myExperiment' directory one can assemble an individual experiment withown sequences. Note, it's necessary to have positive and negative examples fortraining. Positive and negative examples are splitted into training and test sequences.To perform the experiments there are individual READMEs in the respecitive folders.:: General usageTo use the LSTM in general the usage is./lstm -c parameterfile [ -w weightfile [ - test ] ]:: Input formatDatasets should be in FASTA format. There are examples in the datasets folders.:: Parameter file / config fileThe parameter file holds the parameters like number of memory cells, biases,learning rate, number of epochs and the size and the locations of thedatasets. See the example config files in e.g. 'quickstartExperiment' likelstmpars_mem14.ws11.txt etc. See also README.parameter for a more detaileddescription of the parameter file.The example config files in the SCOP benchmark directory are template filesfor the runtrain.sh script where the location to the datasets are inserted automaticallyby a perl call. To use the config files individually it's necessary to replacethe dataset patterns by a valid path.:: Loading weight matricesA previously trained weight matrix can be loaded with the parameter -w.LSTM writes out periodically a weight matrix, see the config file for theinterval.:: Performing a TestIf a weight file is given one can set the parameter -test so that only atest is performed on a trained weight matrix without any training.e.g../lstm -c lstmpars.mem12.ws9.txt -w weight.mat -testThe location of the test datasets can be configured in the parameter file.:: ResultsThe MSE after each epoch is printed out on STDERR as well the percent offalse negatives and false positives. For the test sequences ROC and ROC50is reported. A more informative result is appended to the file out.txt.The last entry in the out.txt after training or test shows the final resultfor training and test.Tests are done periodically while training. The number of epochs aftera test should be performed can be configured in the parameter file.

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -