论文标题

Pynblint:Python Jupyter笔记本的静态分析仪

Pynblint: a Static Analyzer for Python Jupyter Notebooks

论文作者

Quaranta, Luigi, Calefato, Fabio, Lanubile, Filippo

论文摘要

Jupyter Notebook是ML工作流程的早期阶段的许多数据科学家选择的工具。但是,笔记本格式因引起不良编程实践而受到批评。实际上,研究人员已经表明,开源存储库被质量不佳的笔记本淹没。 ML工作流程原型阶段的低质量输出构成了针对ML模型生产的明确瓶颈。为了促进更好的笔记本的创建,我们开发了Pynblint,这是用Python编写的Jupyter笔记本的静态分析器。该工具通过一组经验验证的最佳实践来检查笔记本电脑(和周围存储库)的合规性,并在检测到违规时提供有针对性的建议。

Jupyter Notebook is the tool of choice of many data scientists in the early stages of ML workflows. The notebook format, however, has been criticized for inducing bad programming practices; indeed, researchers have already shown that open-source repositories are inundated by poor-quality notebooks. Low-quality output from the prototypical stages of ML workflows constitutes a clear bottleneck towards the productization of ML models. To foster the creation of better notebooks, we developed Pynblint, a static analyzer for Jupyter notebooks written in Python. The tool checks the compliance of notebooks (and surrounding repositories) with a set of empirically validated best practices and provides targeted recommendations when violations are detected.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源