论文篇

Reading time ~1 minute

分布式

google DistBelief:Large Scale Distributed Deep Networks

画风迁移

LSTM

基于RNN的LSTM架构:解决大词汇量语音识别

NLP

1.基于N-最短路径方法的中文词语粗分模型

2.基于角色标注的中国人名自动识别研究

L2R

机器学习及排序学习基础

条件随机场(CRF)

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

1.GBDT

Friedman <Greedy Function Approximation: A Gradient Boosting Machine>

陈天奇 Introduction to Boosted Trees

kdd2016: XGBoost: A Scalable Tree Boosting System

2.CTR平滑

Click-Through Rate Estimation for Rare Events in Online Advertising

3.神经网络

word2vec

Sentence2Vec

From Word Embeddings To Document Distances

Distributed Representations of Sentences and Documents

Skip-Thought Vectors

Hierarchical Softmax

LDA

推荐系统

LambdaMART