Python Machine Learning/Data Science Project Structure

Question

I'm looking for information on how should a Python Machine Learning project be organized. For Python usual projects there is&#160;Cookiecutter&#160;and for R&#160;ProjectTemplate.This is my current folder structure, but I'm mixing Jupyter Notebooks with actual Python code and it does not seems very clear..
&#9500;&#9472;&#9472; cache
&#9500;&#9472;&#9472; data
&#9500;&#9472;&#9472; my_module
&#9500;&#9472;&#9472; logs
&#9500;&#9472;&#9472; notebooks
&#9500;&#9472;&#9472; scripts
&#9500;&#9472;&#9472; snippets
&#9492;&#9472;&#9472; tools
I work in the scripts folder and currently adding all the functions in files under my_module, but that leads to errors loading data(relative/absolute paths) and other problems.I could not find proper&#160;best practices&#160;or good examples on this topic besides this&#160;kaggle competition solution&#160;and some Notebooks that have all the functions condensed at the start of such Notebook.

Dev · Answer

In response to your question regarding reusing code by importing.py files into notebooks, it is found that appending to the system path is the most successful method. This may make some people shudder, but it appears to be the cleanest way of importing code into a notebook without a pip -e install and a lot of module boilerplate.With the above, one tip is to use the %autoreload and % aimport magics. Here's an illustration:# Load the "autoreload" extension
%load_ext autoreload

# always reload modules marked with "%aimport"
%autoreload 1

import os
import sys

# add the 'src' directory as one where we can import modules
source_dir = os.path.join(os.getcwd(), os.pardir, 'src')
sys.path.append(source_dir)
# import my method from the source code
%aimport preprocess.build_features
If you wanna know more about the&#160;Data Science with Python Training Course Online, go for the training course today.

Python Machine Learning Data Science Project Structure

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Machine Learning

Time series analysis - Machine learning python

What's the difference between the research of Machine learning and data mining?

How can I increase the accuracy of my Linear Regression model?(machine learning with python)

What is semi-supervised machine learning?

Machine Learning and Python Code

On a given dataset would time taken to train n - random forest be equal to time taken to train n X (Decision tree)

how do i change string to a list?

how can i randomly select items from a list?

Training and testing data in machine learning

Training and testing data in machine learning

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES