PyTorch教程16.2之情感分析:使用递归神经网络-电子发烧友网

与词相似度和类比任务一样，我们也可以将预训练词向量应用于情感分析。由于第 16.1 节中的 IMDb 评论数据集不是很大，使用在大规模语料库上预训练的文本表示可能会减少模型的过度拟合。作为图 16.2.1所示的具体示例，我们将使用预训练的 GloVe 模型表示每个标记，并将这些标记表示输入多层双向 RNN 以获得文本序列表示，并将其转换为情感分析输出（Maas等，2011）。对于相同的下游应用程序，我们稍后会考虑不同的架构选择。

https://file.elecfans.com/web2/M00/A9/CD/poYBAGR9PJyAB1TZAAKGTdnYvUk151.svg

图 16.2.1本节将预训练的 GloVe 提供给基于 RNN 的架构进行情绪分析。

						import torch
from torch import nn
from d2l import torch as d2l

batch_size = 64
train_iter, test_iter, vocab = d2l.load_data_imdb(batch_size)

						 

						from mxnet import gluon, init, np, npx
from mxnet.gluon import nn, rnn
from d2l import mxnet as d2l

npx.set_np()

batch_size = 64
train_iter, test_iter, vocab = d2l.load_data_imdb(batch_size)

						 

16.2.1。用 RNN 表示单个文本

在文本分类任务中，例如情感分析，变长的文本序列将被转换为固定长度的类别。在下面的BiRNN类中，虽然文本序列的每个标记都通过嵌入层 ( self.embedding) 获得其单独的预训练 GloVe 表示，但整个序列由双向 RNN ( self.encoder) 编码。更具体地说，双向 LSTM 在初始和最终时间步的隐藏状态（在最后一层）被连接起来作为文本序列的表示。然后通过具有两个输出（“正”和“负”）的全连接层 ( self.decoder) 将该单一文本表示转换为输出类别。

							class BiRNN(nn.Module):
  def __init__(self, vocab_size, embed_size, num_hiddens,
         num_layers, **kwargs):
    super(BiRNN, self).__init__(**kwargs)
    self.embedding = nn.Embedding(vocab_size, embed_size)
    # Set `bidirectional` to True to get a bidirectional RNN
    self.encoder = nn.LSTM(embed_size, num_hiddens, num_layers=num_layers,
                bidirectional=True)
    self.decoder = nn.Linear(4 * num_hiddens, 2)

  def forward(self, inputs):
    # The shape of `inputs` is (batch size, no. of time steps). Because
    # LSTM requires its input's first dimension to be the temporal
    # dimension, the input is transposed before obtaining token
    # representations. The output shape is (no. of time steps, batch size,
    # word vector dimension)
    embeddings = self.embedding(inputs.T)
    self.encoder.flatten_parameters()
    # Returns hidden states of the last hidden layer at different time
    # steps. The shape of `outputs` is (no. of time steps, batch size,
    # 2 * no. of hidden units)
    outputs, _ = self.encoder(embeddings)
    # Concatenate the hidden states at the initial and final time steps as
    # the input of the fully connected layer. Its shape is (batch size,
    # 4 * no. of hidden units)
    encoding = torch.cat((outputs[0], outputs[-1]), dim=1)
    outs = self.decoder(encoding)
    return outs

							 

							class BiRNN(nn.Block):
  def __init__(self, vocab_size, embed_size, num_hiddens,
         num_layers, **kwargs):
    super(BiRNN, self).__init__(**kwargs)
    self.embedding = nn.Embedding(vocab_size, embed_size)
    # Set `bidirectional` to True to get a bidirectional RNN
    self.encoder = rnn.LSTM(num_hiddens, num_layers=num_layers,
                bidirectional=True, input_size=embed_size)
    self.decoder = nn.Dense(2)

  def forward(self, inputs):
    # The shape of `inputs` is (batch size, no. of time steps). Because
    # LSTM requires its input's first dimension to be the temporal
    # dimension, the input is transposed before obtaining token
    # representations. The output shape is (no. of time steps, batch size,
    # word vector dimension)
    embeddings = self.embedding(inputs.T)
    # Returns hidden states of the last hidden layer at different time
    # steps. The shape of `outputs` is (no. of time steps, batch size,
    # 2 * no. of hidden units)
    outputs = self.encoder(embeddings)
    # Concatenate the hidden states at the initial and final time steps as
    # the input of the fully connected layer. Its shape is (batch size,
    # 4 * no. of hidden units)
    encoding = np.concatenate((outputs[0], outputs[-1]), axis=1)
    outs = self.decoder(encoding)
    return outs

							 

让我们构建一个具有两个隐藏层的双向 RNN 来表示用于情感分析的单个文本。

							embed_size, num_hiddens, num_layers, devices = 100, 100, 2, d2l.try_all_gpus()
net = BiRNN(len(vocab), embed_size, num_hiddens, num_layers)

def init_weights(module):
  if type(module) == nn.Linear:
    nn.init.xavier_uniform_(module.weight)
  if type(module) == nn.LSTM:
    for param in module._flat_weights_names:
      if "weight" in param:
        nn.init.xavier_uniform_(module._parameters[param])
net.apply(init_weights);

							 

							embed_size, num_hiddens, num_layers, devices = 100, 100, 2, d2l.try_all_gpus()
net = BiRNN(len(vocab), embed_size, num_hiddens, num_layers)

net.initialize(init.Xavier(), ctx=devices)

16.2.2。加载预训练词向量

embed_size下面我们为词汇表中的标记加载预训练的 100 维（需要与一致）GloVe 嵌入。

							glove_embedding = d2l.TokenEmbedding('glove.6b.100d')

							 

							Downloading ../data/glove.6B.100d.zip from http://d2l-data.s3-accelerate.amazonaws.com/glove.6B.100d.zip...

						

							glove_embedding = d2l.TokenEmbedding('glove.6b.100d')

							 

打印词汇表中所有标记的向量形状。

							embeds = glove_embedding[vocab.idx_to_token]
embeds.shape

							torch.Size([49346, 100])

						

							embeds = glove_embedding[vocab.idx_to_token]
embeds.shape

							(49346, 100)

						

我们使用这些预训练的词向量来表示评论中的标记，并且不会在训练期间更新这些向量。

							net.embedding.weight.data.copy_(embeds)
net.embedding.weight.requires_grad = False

							net.embedding.weight.set_data(embeds)
net.embedding.collect_params().setattr('grad_req', 'null')

16.2.3。训练和评估模型

现在我们可以训练双向 RNN 进行情感分析。

							lr, num_epochs = 0.01, 5
trainer = torch.optim.Adam(net.parameters(), lr=lr)
loss = nn.CrossEntropyLoss(reduction="none")
d2l.train_ch13(net, train_iter, test_iter, loss, trainer, num_epochs, devices)

							 

							loss 0.311, train acc 0.872, test acc 0.850
574.5 examples/sec on [device(type='cuda', index=0), device(type='cuda', index=1)]

https://file.elecfans.com/web2/M00/A9/CD/poYBAGR9PJ6AJIk8AAECA4Wy71Y322.svg

							lr, num_epochs = 0.01, 5
trainer = gluon.Trainer(net.collect_params(), 'adam', {'learning_rate': lr})
loss = gluon.loss.SoftmaxCrossEntropyLoss()
d2l.train_ch13(net, train_iter, test_iter, loss, trainer, num_epochs, devices)

							 

							loss 0.428, train acc 0.806, test acc 0.791
488.5 examples/sec on [gpu(0), gpu(1)]

https://file.elecfans.com/web2/M00/AA/48/pYYBAGR9PKGAE9v0AAEB8Qpd38M668.svg

我们定义了以下函数来使用经过训练的模型预测文本序列的情绪net。

PyTorch教程16.2之情感分析:使用递归神经网络

16.2.1。用 RNN 表示单个文本

16.2.2。加载预训练词向量

16.2.3。训练和评估模型

PyTorch教程10.4之双向递归神经网络

PyTorch教程10.3之深度递归神经网络

PyTorch教程16.3之情感分析:使用卷积神经网络

PyTorch教程16.1之情绪分析和数据集

PyTorch教程8.1之深度卷积神经网络(AlexNet)

PyTorch教程9.6之递归神经网络的简洁实现

PyTorch教程之从零开始的递归神经网络实现

PyTorch教程之循环神经网络

人工神经网络的原理及仿真实例

神经网络基础问题的整理

BP神经网络的研究进展

基于进化计算的神经网络设计与实现

基于神经网络的优化计算实验

神经网络的基本原理

人工神经网络控制

基于不同神经网络的文本分类方法研究对比

几种典型神经网络结构的比较与分析

MATLAB实现卷积神经网络CNN的源代码

3小时学习神经网络与深度学习课件下载

一种基于神经网络的僵尸粉识别模型

基于双向长短期记忆神经网络的交互注意力模型

综述深度神经网络的解释方法及发展趋势

分析总结基于深度神经网络的图像语义分割方法

神经网络的最新发展如何

神经网络的方法学习课件免费下载

基于深度神经网络的文本分类分析

如何使用神经网络实现实体属性情感分析

神经网络的应用及发展的详细资料说明

神经网络与神经网络控制的学习课件免费下载

如何使用神经网络技术实现实体属性的情感分析

BP神经网络的优缺点分析

基于LSTM神经网络的情感分析方法

PyTorch如何实现多层全连接神经网络

pytorch中有神经网络模型吗

递归神经网络和循环神经网络的模型结构

递归神经网络的实现方法

BP神经网络在语言特征信号分类中的应用

PyTorch神经网络模型构建过程

rnn是递归神经网络还是循环神经网络

递归神经网络结构形式主要分为

简述递归神经网络的计算过程

递归神经网络与循环神经网络一样吗

递归神经网络主要应用于哪种类型数据

递归神经网络是循环神经网络吗

递归神经网络的结构、特点、优缺点及适用场景

循环神经网络和递归神经网络的区别

深度神经网络与基本神经网络的区别

神经网络拟合的误差怎么分析

卷积神经网络和bp神经网络的区别

使用PyTorch构建神经网络

卷积神经网络和深度神经网络的优缺点 卷积神经网络和深度神经网络的区别

PyTorch教程-16.3。情感分析：使用卷积神经网络

三个最流行神经网络

什么是神经网络？什么是卷积神经网络？

使用PyTorch深度解析卷积神经网络

循环神经网络LSTM为何如此有效？

基于PyTorch的深度学习入门教程之训练一个神经网络分类器

基于PyTorch的深度学习入门教程之使用PyTorch构建一个神经网络

深入浅出LSTM神经网络

教你用PyTorch快速准确地建立神经网络

下载排行榜

爱华AIWA HS-J202维修手册

PC5502负载均流控制电路数据手册

H110主板CPU PWM芯片ISL95858HRZ-T核心供电电路图资料

UWB653Pro USB口测距通信定位模块规格书

技嘉H110主板IT8628E_BX IO电路图资料

苏泊尔DCL6907(即CHK-S007)单芯片电磁炉原理图资料

卷积神经网络和深度神经网络的优缺点卷积神经网络和深度神经网络的区别