Description:
Regression ensemble of sensillum SST with 4 individual models (I5, I11, I16 and I18) used for prediction of spike amplitude of the extracellularly record of ORN response potentials of the antennal trichoid sensilla to Cx. quinquefasciatus repellents.

The training was performed with Linear Regression algorithm as default in Weka 3.9.4 with a 10-fold cross-validation. Models I5, I11, I16 and I18 were developed by using 7, 10, 7, and 10 QuBiLS-MIDAS descriptors, respectively, namely:

S_F_AB_nCi_2_M14_NS2_n_T_LGL[2-3]_v_MID
S_B_AB_nCi_2_M5_NS4_T_LGL[2-3]_a-h_MID
TS[2]_I50_F_AB_nCi_2_M15_MP1_T_KA_alk_MID
ES_I50_F_AB_nCi_2_M1_SS0_T_LGL[2-3]_e_MID
N2_F_AB_nCi_2_M12_NS5_T_LGL[1-2]_a_MID
VC_TrC_AB_nCi_3_M20(M16)_SS3_T_LGTP[12-13]_p_MID
K_F_AB_nCi_2_M10_SS7_o_T_LGP[2]_a_MID
MN_Q_AB_nCi_2_M10_SS2_T_LGP[2]_v_MID
AC[1]_S_F_AB_nCi_2_M10_MP0_T_LGP[2]_c_MID
ES_K_B_AB_nCi_2_M3_NS0_o_T_LGP[2]_v-h_MID
ES_S_B_AB_nCi_2_M3_NS6_n_T_LGP[2]_a-psa_MID
K_B_AB_nCi_2_M3_NS7_T_LGL[2-3]_r-v_MID
RA_TrF_AB_nCi_3_M21(M13)_MP6_T_LG3L[2-3]_LGL[2-3]_p_MID
ES_I50_Q_AB_nCi_2_M1_SS1_n_T_LGP[1]_e_MID
ES_S_B_AB_nCi_2_M3_SS6_T_LGP[2]_a-e_MID
AC[4]_S_B_AB_nCi_2_M5_NS6_T_LGL[1-2]_m-s_MID

Training set:
50 compounds extracted from the Liu, et al., 2013 original experimental research constitute the training set 10.1016/j.jinsphys.2013.08.016

Performance:
For a 10-fold cross-validation, the statistical parameters (performance without applicability domain) are R = 0.9837, MAE = 1.1587, RMSE = 2.7192, RAE = 10.5179 %, and RRSE = 17.8304 %.

Classification Breakpoint:
The breakpoint for sensillum SST is 43 spikes/s. Values greater than or equal to the breakpoint will elicit a response in this specific sensillum. Lower values represent certain actions occurring, however, they are not enough to activate the sensillum response.

Reference:
Liu et al. Olfactory responses of the antennal trichoid sensilla to chemical repellents in the mosquito, Culex quinquefasciatus. 2013, 59(11), 1169-1177. DOI: 10.1016/j.jinsphys.2013.08.016