_PROBLEM CoEPrA-2006_Classification_004 _GROUP_NAME Reiji Teramoto _GROUP_MEMBERS Reiji Teramoto _ADDRESS Fundamental and Environmental Research Laboratories NEC Corporatopn _MODELING_PROCEDURE We used marginalized count kernels for amino acid sequences to generate new descriptors and suppor vector machines as classifier.[1,2]. To select descriptors and calibrate model, we performed 10-fold cross-validation via tuning types of marginalized count kernel, gaussian kernel parameter g and coefficient of slack variables C using the calibration dataset with setting class +1 to weight 5. We examined 6 types of marginalized count kernel as follows: 1. 1st-order maginalized count kernel with hidden markov models that hidden states is 3(abbreviation MCK1(h=3)). 2. 2nd-order maginalized count kernel with hidden markov models that hidden states is 3(abbreviation MCK2(h=3)). 3. 1st-order maginalized count kernel with hidden markov models that hidden states is 3(abbreviation MCK1(h=4)). 4. 2nd-order maginalized count kernel with hidden markov models that hidden states is 4(abbreviation MCK2(h=4)). 5. 1st-order maginalized count kernel with hidden markov models that hidden states is 5(abbreviation MCK1(h=5)). 6. 2nd-order maginalized count kernel with hidden markov models that hidden states is 5(abbreviation MCK2(h=5)). We examined gaussian kernel parameter g defined as follows: K(u,v) = exp(-g*|u-v|^2) We determined the final model based on the performance of 10-fold cross-validation as follows: MCK1(h=5): c=32.0, g=8.0, accuracy=73.8739%. 1. Tsuda K, Kin T, Asai K., Marginalized kernels for biological sequences, Bioinformatics. 2002;18 Suppl 1:S268-75. 2. Vapnik V, The Nature of Statistical Learning Theory, Springer, 1999. _PREDICTION Obj_00001 -1 Obj_00002 -1 Obj_00003 -1 Obj_00004 -1 Obj_00005 -1 Obj_00006 -1 Obj_00007 -1 Obj_00008 -1 Obj_00009 -1 Obj_00010 +1 Obj_00011 -1 Obj_00012 -1 Obj_00013 -1 Obj_00014 -1 Obj_00015 -1 Obj_00016 -1 Obj_00017 -1 Obj_00018 -1 Obj_00019 -1 Obj_00020 -1 Obj_00021 -1 Obj_00022 -1 Obj_00023 -1 Obj_00024 +1 Obj_00025 -1 Obj_00026 -1 Obj_00027 -1 Obj_00028 -1 Obj_00029 -1 Obj_00030 -1 Obj_00031 -1 Obj_00032 -1 Obj_00033 -1 Obj_00034 +1 Obj_00035 +1 Obj_00036 -1 Obj_00037 -1 Obj_00038 -1 Obj_00039 +1 Obj_00040 -1 Obj_00041 +1 Obj_00042 +1 Obj_00043 +1 Obj_00044 +1 Obj_00045 -1 Obj_00046 +1 Obj_00047 -1 Obj_00048 -1 Obj_00049 -1 Obj_00050 -1 Obj_00051 -1 Obj_00052 -1 Obj_00053 -1 Obj_00054 -1 Obj_00055 -1 Obj_00056 -1 Obj_00057 -1 Obj_00058 -1 Obj_00059 -1 Obj_00060 -1 Obj_00061 +1 Obj_00062 +1 Obj_00063 -1 Obj_00064 +1 Obj_00065 -1 Obj_00066 -1 Obj_00067 -1 Obj_00068 -1 Obj_00069 +1 Obj_00070 -1 Obj_00071 +1 Obj_00072 -1 Obj_00073 -1 Obj_00074 -1 Obj_00075 -1 Obj_00076 -1 Obj_00077 -1 Obj_00078 -1 Obj_00079 -1 Obj_00080 +1 Obj_00081 -1 Obj_00082 -1 Obj_00083 -1 Obj_00084 -1 Obj_00085 -1 Obj_00086 -1 Obj_00087 -1 Obj_00088 -1 Obj_00089 -1 Obj_00090 +1 Obj_00091 -1 Obj_00092 +1 Obj_00093 +1 Obj_00094 -1 Obj_00095 -1 Obj_00096 -1 Obj_00097 -1 Obj_00098 -1 Obj_00099 -1 Obj_00100 -1 Obj_00101 +1 Obj_00102 -1 Obj_00103 -1 Obj_00104 -1 Obj_00105 -1 Obj_00106 -1 Obj_00107 -1 Obj_00108 -1 Obj_00109 -1 Obj_00110 -1 Obj_00111 -1