docs: add Google-style docstrings to dspy/datasets/dataloader.py by saivedant169 · Pull Request #9458 · stanfordnlp/dspy

saivedant169 · 2026-03-16T15:59:41Z

Resolves #9457
Part of #8926

Description

Adds comprehensive Google-style docstrings with Args, Returns, Raises, and Example sections to all 9 public APIs in DataLoader:

DataLoader class docstring
from_huggingface() — includes split behavior (dict vs flat list)
from_csv()
from_pandas()
from_json()
from_parquet()
from_rm()
sample()
train_test_split() — documents float vs int size semantics

Each docstring includes a runnable code example showing typical usage.

ruff check and ruff format pass clean.

Add comprehensive docstrings with Args, Returns, Raises, and Example sections to all 9 public APIs in DataLoader: - DataLoader class - from_huggingface, from_csv, from_pandas, from_json, from_parquet - from_rm, sample, train_test_split Resolves stanfordnlp#9457

MaximeRivest · 2026-03-16T17:42:04Z

before reviewing docstrings pr we now ask that you please add screenshots of you pr's changes as we would see them on dspy.ai after the pr is merged.

See #8926 (comment)

saivedant169 · 2026-03-16T18:26:05Z

@MaximeRivest I tried building the docs locally with mkdocs serve but the build crashes on a nbconvert / Python 3.14 incompatibility when processing Jupyter notebook pages (ValueError: No template sub-directory with name 'lab'). This is unrelated to my docstring changes — it fails before reaching the API reference pages.

I verified that all 9 docstrings parse correctly through griffe (the handler mkdocstrings uses). Here's the output from griffe.load('dspy.datasets.dataloader'):

DataLoader class:

Utility for loading datasets from various sources into DSPy Examples.

DataLoader provides methods to load data from Hugging Face Hub, CSV, JSON,
Parquet files, Pandas DataFrames, and retrieval modules, converting each row
into a dspy.Example with the specified input keys.

Methods documented (all 9/9):

from_huggingface() — Args, Returns, Raises, Example
from_csv() — Args, Returns, Example
from_pandas() — Args, Returns, Example
from_json() — Args, Returns, Example
from_parquet() — Args, Returns, Example
from_rm() — Args, Returns, Raises
sample() — Args, Returns, Raises, Example
train_test_split() — Args, Returns, Raises, Example

Every docstring follows Google style and includes runnable >>> examples. If someone with Python 3.12/3.13 can confirm the full mkdocs render, happy to add screenshots from their build.

MaximeRivest · 2026-03-16T18:33:21Z

please, push through. it does build. once you do see the docs locally, you will notice that you need to change some elements in you docstrings to respect formats.

see: #9445 and #9444 for example of format and change depth we expect.

saivedant169 · 2026-03-16T19:53:33Z

@MaximeRivest Here are the rendered docs screenshots from local mkdocs build:

MaximeRivest · 2026-03-18T18:37:05Z

Please fix the formatting before we can engage into reviewing the content.

This is one example of a formatting issue:

saivedant169 · 2026-03-18T19:06:52Z

saivedant169 · 2026-03-24T19:19:26Z

Hey @MaximeRivest, just checking in on this one. Let me know if anything needs changing or if you'd rather handle it differently.

MaximeRivest · 2026-03-27T16:52:52Z

hello @saivedant169, do you mind providing a proof that all your examples run?

saivedant169 · 2026-03-27T16:53:59Z

yes for sure

saivedant169 · 2026-03-27T17:18:57Z

saivedant169 · 2026-03-27T17:19:12Z

is this good @MaximeRivest ?

MaximeRivest · 2026-03-27T17:21:31Z

thank you! good job on running all those checks and tests. I will now review the text in the coming days.

saivedant169 mentioned this pull request Mar 17, 2026

docs: add docstrings to predict/refine.py and predict/best_of_n.py #9463

Open

saivedant169 added 2 commits March 18, 2026 14:47

fix docstring formatting for mkdocstrings rendering

ac992f8

switch examples to fenced code blocks for mkdocstrings

f8a196d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add Google-style docstrings to dspy/datasets/dataloader.py#9458

docs: add Google-style docstrings to dspy/datasets/dataloader.py#9458
saivedant169 wants to merge 3 commits intostanfordnlp:mainfrom
saivedant169:docstrings-dataloader

saivedant169 commented Mar 16, 2026

Uh oh!

MaximeRivest commented Mar 16, 2026

Uh oh!

saivedant169 commented Mar 16, 2026

Uh oh!

MaximeRivest commented Mar 16, 2026 •

edited

Loading

Uh oh!

saivedant169 commented Mar 16, 2026

Uh oh!

MaximeRivest commented Mar 18, 2026

Uh oh!

saivedant169 commented Mar 18, 2026

Uh oh!

saivedant169 commented Mar 24, 2026

Uh oh!

MaximeRivest commented Mar 27, 2026

Uh oh!

saivedant169 commented Mar 27, 2026

Uh oh!

saivedant169 commented Mar 27, 2026

Uh oh!

saivedant169 commented Mar 27, 2026

Uh oh!

MaximeRivest commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

saivedant169 commented Mar 16, 2026

Description

Uh oh!

MaximeRivest commented Mar 16, 2026

Uh oh!

saivedant169 commented Mar 16, 2026

Uh oh!

MaximeRivest commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

saivedant169 commented Mar 16, 2026

Uh oh!

MaximeRivest commented Mar 18, 2026

Uh oh!

saivedant169 commented Mar 18, 2026

Uh oh!

saivedant169 commented Mar 24, 2026

Uh oh!

MaximeRivest commented Mar 27, 2026

Uh oh!

saivedant169 commented Mar 27, 2026

Uh oh!

saivedant169 commented Mar 27, 2026

Uh oh!

saivedant169 commented Mar 27, 2026

Uh oh!

MaximeRivest commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MaximeRivest commented Mar 16, 2026 •

edited

Loading