Text this: Identification of informative genes and pathways using an improved penalized support vector machine with a weighting scheme