predict-2nd	
	part of the SAM-T02 suite

Tue Feb 19 15:42:15 PST 2008 Kevin Karplus

This source code is being released "as-is" for researchers who wish to
duplicate our work on protein local structure prediction.  Since we
have never received any funding for making an easily-installed,
user-friendly system, this releases is about all we can afford to do.

Since this directory is also our working directory for recent changes,
not a cleaned-up frozen snapshot of the code, it most likely contains
a lot of junk, as well as the essential parts of the code.

In addition to the code in predict-2nd, you will also need the
libraries in ../ultimate


Mon Jun 13 03:09:07 PDT 2005 Kevin Karplus

The following "Makefile" fragment, taken from the SAM-T02 server,
gives an example of running predict-2nd given an a2m file and an
already trained network.

############################################################
ROOTDIR=/projects/compbio/experiments/protein-predict/SAM_T02
HTMLDIR = /projects/compbio/.html/SAM_T02
WORKDIR= ${HTMLDIR}/results/${QUERYDIR}
BASEURL = http://www.soe.ucsc.edu/projects/compbio/SAM_T02/results/${QUERYDIR}
BIN = ${ROOTDIR}/bin_freeze
BIN-PREDICT2ND = ${BIN}/predict_2nd
PREDICT-2ND = ${BIN-PREDICT2ND}/predict-2nd
STR-NET = ${BIN-PREDICT2ND}/networks/t2k-5651-IDaaHr-5-15-7-15-9-15-13-str-seeded.net
%.t2k.str %.t2k.str.rdb %.t2k.str.seq : %.t2k-thin90.a2m.gz ${STR-NET}
	echo ReadAlphabet /projects/compbio/lib/alphabet/str.alphabet > tmp.script
	echo ReadNeuralNet ${STR-NET} >> tmp.script
	echo ReadA2M $< >> tmp.script
	echo PrintPrediction $*.t2k.str  >> tmp.script
	echo PrintPredictionFasta $*.t2k.str.seq >> tmp.script
	echo PrintRDB $*.t2k.str.rdb >> tmp.script
	${PREDICT-2ND} < tmp.script
	rm tmp.script
	${SCRIPTSDIR}/add_summary_html "STR structure prediction CASP format" ${BASEURL}/$*.t2k.str $*.t2k.str >> ${WORKDIR}/summary.html
	${SCRIPTSDIR}/add_summary_html "STR structure prediction RDB format" ${BASEURL}/$*.t2k.str.rdb $*.t2k.str.rdb >> ${WORKDIR}/summary.html
	${SCRIPTSDIR}/add_summary_html "STR EBGHTL structure prediction sequence format" ${BASEURL}/$*.t2k.str.seq $*.t2k.str.seq >> ${WORKDIR}/summary.html
############################################################

Currently, trained networks (and the scripts to train them) can be
found in /projects/compbio/usr/karplus/predict-2nd/testing/
roughly sorted into directories according to the type of alphabet
being predicted.

The alphabet files can be found in /projects/compbio/lib/alphabet/
Several alphabets may appear in the same file, which makes creating
<<<<<<< README
the make files needlessly difficult, since the mapping form alphabet
name to alphabet file is not obvious.  Changing this would require
making several changes in pipelines and makefiles that use
perdict-2nd, which we have not had time to do.

Sun Jun  1 09:04:32 PDT 2008 Kevin Karplus

alphabets/ subdirectory added to pre-release version on web, giving
access to all our alphabets.

Because std.alphabet location is hardwired in ultimate/src/Alphabet/Makefile,
that file needs to be changed before compiling on a different system.
If you need to run the program without recompiling, you can use
command-line option "-noalph" to skip reading in alphabets from
standard locations.  You will then need explicit ReadAlphabet commands
in your scripts, even for the standard alphabets.
=======
the make files needlessly difficult, since the mapping from alphabet
name to alphabet file is not obvious.
>>>>>>> 1.6


Thu Mar  5 16:04:02 PST 2009 Kevin Karplus

Note: the "zlib" package is needed by predict-2nd and ultimate, but is
not included here.  Look for it at
	http://www.zlib.net