nepp's blog

1年1月1日

RNN/LSTM、Encoder-Decoder 和 Attention RNN：循环神经网络 LSTM：long short-term memory 语言模型（ language model）定义了自然语言中标记序列的概率分布。 Embedding几乎是在 NLP 任务使用深度学习的标准步骤。我们可以通过 Word2Vec、GloVe 等从未标注数据无监督的学习到词的 Embedding，然后把它用到不同的特定任务中。这种方法得到的 Embedding 叫作预训练的 (pretrained)Embedding。如果特定任务训练数据较多，那么我们可以用预训练的 Embedding 来初始化模型的 Embedding，然后用特定任务的监督数据来 fine-tuning。如果监督数据较少，我们可以固定 (fix)Embedding，只让模型学习其它的参数。这也可以看成一种 Transfer Learning。 Transformer BERT stands for “Bidirectional Encoder Representation with Transformers”. To put it in simple words BERT extracts patterns or representations from the data or word embeddings by passing it through an encoder. The encoder itself is a transformer architecture that is stacked together. It is a bidirectional transformer which means that during training it considers the context from both left and right of the vocabulary to extract patterns or representations. BERT uses two training paradigms: Pre-training and Fine-tuning. During pre-training, the model is trained on a large dataset to extract patterns. This is generally an unsupervised learning task where the model is trained on an unlabelled dataset like the data from a big corpus like Wikipedia.……

阅读全文

1年1月1日

title: “UNIX环境高级编程” date: 2021-05-03T21:12:44+08:00 draft: false categories: [“数据结构与算法”] tags: [“数据结构与算法”] 第15章进程间通信 ![image-20210512113039490](/Users/anpuqiang/Library/Application Support/typora-user-images/image-20210512113039490.png)……

阅读全文

1年1月1日

Ai push scheduler分时区发送增加俄罗斯在us-en基础上增加ru-ru 本地落盘缓存延迟发送的token 原有实现将延迟发送的token和延迟时间保存在内存中sleep 程序重启后内存中数据会丢失，增加落盘落盘实现分文件保存，文件名称country_language_send_time(send_time加个随机，防止重复) delayWriteAIMsgsToKafka中函数开始写文件，若成功发送后删除缓存文件缓存文件内格式- json 重启后，获取local-time，遍历缓存文件，计算sleep-time 图片服务错误图片替换复用upload页面后台增加cdn删除接口的调用搜索近似图的过程展示 expresswallet://user/kyc/result?id=xxx (根据id来查询kyc结果) expresswallet://user/complaint/detail?id=xxx (根据id来查询complaint结果) expresswallet://trans/history/detail?id=xxx&type=订单类型(id为订单id，后端定义的订单类型，买、卖、转账等) expresswallet://home expresswallet://invest expresswallet://earn expresswallet://profile push后台mysql字段……

阅读全文

1年1月1日

title: “面试” date: 2021-05-03T21:12:44+08:00 draft: false categories: [“面试”] tags: [“面试”] 自我介绍考察表达能力，面试准备情况 5 校内情况讨论项目、竞赛、论文方面硕士期间研究方向考察对计算机的学习热情、自主学习能力 4 计算机基础基于简历，从操作系统、网络、编译原理和体系结构里出2-3个问题进程操作系统运行程序大概流程必须做的第一件事是将代码和所有静态数据（例如初始化变量）加载（load）到内存中，加载到进程的地址空间中。堆/栈区别线程共享地址空间字节序数据在内存中的存储顺序大端/小端 tcp/udp 传输层区别 tcp三次握手/四次挥手 I/O多路复用，select poll epoll 考察专业基础是否扎实 grpc 语言基础 c/c++为主 HashMap底层结构哈希表和红黑树 vector 和数组区别 volatile cpu多级缓存和内存屏障 memcpy和strcpy的差别函数重载 this指针结合编译和运行时内存对齐的原理与意义内存泄露 https://www.cnblogs.com/skynet/archive/2011/02/20/1959162.html stack overflow 代码题目排序算法，归并排序，快排二叉树层次遍历，z型打印 LRU/堆判断链表环判断是否为二叉搜索树二叉树中序遍历根据前序遍历，中序遍历结果还原二叉树判断平衡二叉树求二叉树深度二叉搜索树，添加/删除节点链表中点 /**……

阅读全文

最近文章

福利派送

分类

标签

友情链接

其它