* save/load automatically single out large arrays + allow mmap
* allow .gz/.bz2 corpus filenames => transparently (de)compressed I/O
* CBOW model for word2vec (Sébastien Jean, 176)
* new API for storing corpus metadata (Joseph Chang, 169)
* new LdaMallet class = train LDA using wrapped Mallet
* new MalletCorpus class for corpora in Mallet format (Christopher Corley, 179)
* better Wikipedia article parsing (Joseph Chang, 170)
* word2vec load_word2vec_format uses less memory (Yves Raimond, 164)
* load/store vocabulary files for word2vec C format (Yves Raimond, 172)
* HDP estimation on new documents (Elliot Kulakow, 153)
* store labels in SvmLight corpus (Ritesh, 152)
* fix word2vec binary load on Windows (Stephanus van Schalkwyk)
* replace numpy.svd with scipy.svd for more stability (Sven Döring, 159)
* parametrize LDA constructor (Christopher Corley, 174)
* steps toward py3k compatibility (Lars Buitinck, 154)