BERT模型源码解析 _生活百科

BERT模型源码解析
modeling.py
目录
属性
类
class BertConfig(object)   BERT模型配置参数类
class BertModel(object)   BERT模型类
函数
def gelu(x)  格鲁激活函数
def get_activation(activation_string) 通过名称获取激活函数
def get_assignment_map_from_checkpoint 读取检查点函数
def dropout(input_tensor, dropout_prob) 丢弃函数，按一定比例丢弃权重数据
def layer_norm(input_tensor, name=None) 数据标准化
def layer_norm_and_dropout 先标准化，再丢弃
def create_initializer(initializer_range=0.02) 数据初始化
def embedding_lookup 嵌入查找函数
def embedding_postprocessor 嵌入处理函数
def create_attention_mask_from_input_mask 创建注意力掩码
def attention_layer 注意力层处理函数
def transformer_model transformer模型
def get_shape_list 获取张量的形状参数列表
def reshape_to_matrix(input_tensor) 将张量转换为二维矩阵
def reshape_from_matrix(output_tensor, orig_shape_list) 将二维张量转换为指定维数
def assert_rank(tensor, expected_rank, name=None) 断言张量的维数
源码
许可信息
# coding=utf-8 编码使用utf-8
# Copyright 2018 The Google AI Language Team Authors.版权术语谷歌语言团队的作者
#
# Licensed under the Apache License, Version 2.0 (the "License");根据Apache许可证进行许可
# you may not use this file except in compliance with the License.
如不符合许可证的规定，则不可使用本文件
# You may obtain a copy of the License at 可以通过下面的网址获取许可证副本
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
"""The main BERT model and related functions."""
导入依赖
from __future__ import absolute_import
from __future__ import division
from __future__ import print_function
import collections
import copy
import json
import math
import re
import numpy as np
import six
import tensorflow as tf
模型配置
构造函数
参数说明
class BertConfig(object):
"""Configuration for `BertModel`."""对BERT模型进行参数配置
def __init__(self,
vocab_size,
hidden_size=768,
num_hidden_layers=12,
num_attention_heads=12,
intermediate_size=3072,
hidden_act="gelu",
hidden_dropout_prob=0.1,
attention_probs_dropout_prob=0.1,
max_position_embeddings=512,
type_vocab_size=16,
initializer_range=0.02):
"""Constructs BertConfig.构造函数
Args:参数说明
vocab_size: Vocabulary size of `inputs_ids` in `BertModel`.
inputs_ids集合的大小
hidden_size: Size of the encoder layers and the pooler layer.
编码层和池化层的大小
num_hidden_layers: Number of hidden layers in the Transformer encoder.
Transformer 编码器中隐藏层个数
num_attention_heads: Number of attention heads for each attention layer in
the Transformer encoder.
Transformer 编码器中每个注意层的头数
intermediate_size: The size of the "intermediate" (i.e., feed-forward)
layer in the Transformer encoder.
Transformer 编码器中中间层个数
hidden_act: The non-linear activation function (function or string) in the
encoder and pooler.
编码器和池化器的激活函数
hidden_dropout_prob: The dropout probability for all fully connected

BERT模型源码解析

经验总结扩展阅读

猫咪加菲猫生下一只独子，发现样子和自己长得不一样，猫：真的很忧伤

瘦男生冬天穿衣搭配这种风格超级时尚

知乎上有过一个高评话题：“什么样的人往往是最厉害的？”评论区点赞最多的是这样一条回答——...|安静的女人，其实都懂得和自己的灵魂对话

面部|美人计｜周迅是内娱狐系美女天花板吧！

砂锅粉怎么吃?

三个月后，王梅回来上班了，却没有想到经理把她叫到办公室

艾尔登法环白龙有剧情吗

百罗是什么意思百罗怎么理解

孕妈在孕期，4种食物每天坚持吃，也许能帮你长胎不长肉

菠萝|成年男女晚间运动姿势一览

顶级高仿手表品牌：究竟值得购买吗,天梭高仿手表怎么样

致青春电影结局是什么?

1991年6月出生的属羊人2019年婚姻运势如何？如何办理结婚证？

2022大暑过后几天凉 2022年大暑后要多少天不热

2023年1月10日剖腹产吉日一览表 2023年1月10日剖腹产好不好

2024年十月十八出生廖姓女孩名字怎么取生辰八字五行查询

怎么弄隐藏图标（右下角图标隐藏了怎么弄出来)

智能医学工程是坑吗是冷门专业吗

奶黄包怎么做

献给天下老人的一篇好文，读完心情顺畅！群里的老姐老哥也看看！