资源 | 2017年GitHub中最为流行的30个开源机器学习项目

openlog 2019-09-10

展开全文

　　机器之心编译

　　参与：李泽南

　　2017 年里哪些机器学习项目最受人关注？Mybridge 为我们整理了一份 Top 30 列表，以下所有项目均附有 GitHub 链接。

　　我们对比了近 8800 个开原机器学习项目，并挑选了其中最好的 30 个列举于此。这是一个非常具有竞争力的列表，其中包含 2017 年 1 月-12 月份开源的各类优秀机器学习库、数据集和应用。Mybridge AI 通过流行度、参与度和新鲜程度来对它们进行评级。先给你一个直观印象：它们的 GitHub 平均 stars 是 3558 个。

　　开源项目对于数据科学家而言非常有意义，我们可以通过阅读源代码，在前人的基础上构建更加强大的项目。现在，你可以尽情尝试一下这些去年最佳项目了。

　　No.1

　　FastText：快速文本表示/分类库，来自 Facebook（GitHub 11,786 stars）

　　链接：https://github.com/facebookresearch/fastText

　　参考内容：Facebook发布新版fastText：拓展至移动端，加入教程

　　又及 Muse：多语言无监督/监督词嵌入，基于 FastText（GitHub 695 stars）

　　链接：https://github.com/facebookresearch/MUSE

　　No.2

　　Deep-photo-styletransfer：康奈尔大学 Fujun Luan 论文《Deep Photo Style Transfer》的代码与数据（GitHub 9747 stars）

　　链接：https://github.com/luanfujun/deep-photo-styletransfer

　　No.3

　　face recognition：最简单的 Python 命令行面部识别 API，来自 Adam Geitgey（GitHub 8672 stars）

　　链接：https://github.com/ageitgey/face_recognition

　　参考内容：基于Python的开源人脸识别库：离线识别率高达99.38%

　　No.4

　　Magenta：机器智能音乐与艺术生成器（GitHub 8113 stars）

　　链接：https://github.com/tensorflow/magenta

　　参考内容：谷歌Magenta项目是如何教神经网络编写音乐的？

　　No.5

　　Sonnet：基于 TensorFlow 的神经网络库（GitHub 5731 stars），来自 DeepMind 成员 Malcolm Reynolds

　　链接：https://github.com/deepmind/sonnet

　　参考内容：DeepMind开源Sonnet：可在TensorFlow中快速构建神经网络

　　No.6

　　deeplearn.js：来自 Google Brain 团队 Nikhil Thorat 的网页端硬件加速机器学习库（GitHub 5462 stars）

　　链接：https://github.com/PAIR-code/deeplearnjs

　　参考内容：谷歌开源DeepLearn.js：可在网页上实现硬件加速的机器学习

　　No.7

　　Fast Style Transfer：TensorFlow 快速风格转换，来自 MIT 的 Logan Engstrom（GitHub 4843 stars）

　　链接：https://github.com/lengstrom/fast-style-transfer

　　No.8

　　Pysc2：星际争霸 2 学习环境，来自 DeepMind Timo Ewalds 等人（GitHub 3683 stars）

　　链接：https://github.com/deepmind/pysc2

　　No.9

　　AirSim：基于虚幻引擎的开源自动驾驶模拟器，由微软研究院 Shital Shah 等人提出（GitHub 3861 stars）

　　链接：https://github.com/Microsoft/AirSim

　　No.10

　　Facets：机器学习数据集可视化工具，来自 Google Brain（GitHub 3371 stars）

　　链接：https://github.com/PAIR-code/facets

　　参考内容：谷歌开源机器学习可视化工具 Facets：从全新角度观察数据

　　No.11

　　Style2Paints：AI 漫画线稿上色工具，来自苏州大学（GitHub 3310 stars）

　　链接：https://github.com/lllyasviel/style2paints

　　参考内容：Style2paints：专业的AI漫画线稿自动上色工具

　　No.12

　　Tensor2Tensor：用于广义序列-序列模型的工具库，来自 Google Brain 的 Ryan Sepassi（GitHub 3087 stars）

　　链接：https://github.com/tensorflow/tensor2tensor

　　参考内容：一个模型库学习所有：谷歌开源模块化深度学习系统Tensor2Tensor

　　No.13

　　CycleGAN and pix2pix in PyTorch：基于 PyTorch 的图像-图像转换工具，来自 UC Berkeley 在读博士朱俊彦（GitHub 2847 stars）

　　链接：https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix

　　参考内容：你来手绘涂鸦，人工智能生成「猫片」：edges2cats图像转换详解

　　No.14

　　Faiss：用密集向量高效相似性搜索与聚类的工具库，来自 Facebook（GitHub 2629 stars）

　　链接：https://github.com/facebookresearch/faiss

　　No.15

　　Fashion-mnist：一个类似于 MNIST 的时尚产品数据集，来自 Zalando Tech 的 Han Xiao（GitHub 2780 stars）

　　链接：https://github.com/zalandoresearch/fashion-mnist

　　No. 16

　　ParlAI：用于在各种公开可用的对话数据集上训练与评估 AI 模型的框架，来自 Facebook 的 Alexander Miller（GitHub 2578 stars）

　　链接：https://github.com/facebookresearch/ParlAI

　　参考内容：Facebook开源人工智能框架ParlAI：可轻松训练评估对话模型

　　No.17

　　Fairseq：来自 FAIR 的序列到序列工具包（GitHub 2571 stars）

　　链接：https://github.com/facebookresearch/fairseq

　　参考内容：Facebook提出全新CNN机器翻译：准确度超越谷歌而且还快九倍（已开源）

　　No.18

　　Pyro：使用 Python 和 PyTorch 进行深度通用概率编程，来自 Uber AI Labs（GitHub 2387 stars）

　　链接：https://github.com/uber/pyro

　　参考内容：Uber与斯坦福大学开源深度概率编程语言Pyro：基于PyTorch

　　No.19

　　iGAN：基于 GAN 的交互图像生成器（GitHub 2369 stars）

　　链接：https://github.com/junyanz/iGAN

　　参考内容：伯克利大学和Adobe开源深度学习图像编辑工具 iGAN

　　No.20

　　Deep-image-prior：使用神经网络进行图像恢复，同时无需学习过程，来自 Skoltech 的 Dmitry Ulyanov（GitHub 2188 stars）

　　链接：https://github.com/DmitryUlyanov/deep-image-prior

　　No.21

　　Face classification：基于 Keras CNN 模型与 OpenCV 的实时面部检测和表情/性别分类，训练与 fer2013/imdb 数据集（GitHub 1967 stars）

　　链接：https://github.com/oarriaga/face_classification

　　No.22

　　Speech to Text WaveNet：使用 DeepMind 的 WaveNet 和 TensorFlow 构成的端到端句级英语语音识别，来自 Kakao Brain 的 Namju Kim（GitHub 1961 stars）

　　链接：https://github.com/buriburisuri/speech-to-text-wavenet

　　参考内容：DeepMind WaveNet，将机器合成语音水平与人类差距缩小50%

　　No.23

　　StarGAN：用于多领域图像-图像转换的统一生成对抗网络（GitHub 1954 stars）

　　链接：https://github.com/yunjey/StarGAN

　　No.24

　　MI-agents：Unity 机器学习智能体，来自 Unity3D 的 Arthur Juliani（GitHub 1658 stars）

　　链接：https://github.com/Unity-Technologies/ml-agents

　　No.25

　　Deep Video Analytics：分布式可视化搜索和可视化数据分析平台，来自康奈尔大学的 Akshay Bhat（GitHub 1494 stars）

　　链接：https://github.com/AKSHAYUBHAT/DeepVideoAnalytics

　　No.26

　　OpenNMT：Torch 上的开源神经机器翻译（GitHub 1490 stars）

　　链接：https://github.com/OpenNMT/OpenNMT

　　参考内容：哈佛大学NLP组开源神经机器翻译工具包OpenNMT：已达到生产可用水平

　　No.27

　　Pix2PixHD：使用条件 GAN 合成和处理 2048×1024 分辨率的图像，来自英伟达 AI 科学家 Ming-Yu Liu（GitHub 1283 stars）

　　链接：https://github.com/NVIDIA/pix2pixHD

　　No.28

　　Horovod：分布式 TensorFlow 训练框架，来自 Uber 工程团队（GitHub 1188 stars）

　　链接：https://github.com/uber/horovod

　　参考内容：详解Horovod：Uber开源的TensorFlow分布式深度学习框架

　　No.29

　　AI-Blocks：强大而直观的 WYSIWYG 界面，可让任何人创建机器学习模型（GitHub 899 stars）

　　链接：https://github.com/MrNothing/AI-Blocks

　　No.30

　　Voice Conversion with Non-Parallel Data：基于 TensorFlow 的深度神经网络语音转换（语音风格转换），来自 Kakao Brain 团队的 Dabi Ahn（GitHub 845 stars）

本站是提供个人知识管理的网络存储空间，所有内容均由用户发布，不代表本站观点。请注意甄别内容中的联系方式、诱导购买等信息，谨防诈骗。如发现有害或侵权内容，请点击一键举报。

转藏分享

QQ空间 QQ好友新浪微博微信

献花（0） +1

来自： openlog > 《人工智能》

举报/认领

0条评论

发表

请遵守用户评论公约

类似文章 更多

openlog

关注对话

TA的最新馆藏

The Generative AI Stack: Making the Future Happen Faster
Emerging Architectures for Modern Data Infrastructure
投资攀升，2021上半年美国教育科技投资增长300%
AnalyticDB实现和特点浅析
安防视频上云的趋势、痛点及华为云视频接入服务探索及实践
盘点 15 个好用的 API 接口管理神器

喜欢该文的人也喜欢更多

热门阅读换一换