Skip to content

Evaluation for mailcom#118

Merged
kimlee87 merged 23 commits intomainfrom
eval
Apr 20, 2026
Merged

Evaluation for mailcom#118
kimlee87 merged 23 commits intomainfrom
eval

Conversation

@kimlee87
Copy link
Copy Markdown
Contributor

@kimlee87 kimlee87 commented Apr 10, 2026

Document evaluation strategy and related results

  • Qualitative evaluation
    • mailcom functionalities and misaligned cases
    • mailcom vs. other open-source pseudonymization tools
      • QualiAnon (by their tutorial video)
      • Amnesia (by their playground webpage)
      • Presidio
      • Scrubadub
  • Quantitative evaluation (mailcom vs. Presidio)
    • email detection
    • NER
    • number detection
  • Summarize findings

@codecov
Copy link
Copy Markdown

codecov Bot commented Apr 10, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.59%. Comparing base (faf6624) to head (4e8831d).
⚠️ Report is 24 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main     #118      +/-   ##
==========================================
- Coverage   99.63%   99.59%   -0.04%     
==========================================
  Files          13       13              
  Lines        2191     2494     +303     
==========================================
+ Hits         2183     2484     +301     
- Misses          8       10       +2     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds project documentation describing how mailcom was evaluated (qualitative + quantitative) and provides reproducible notebooks/scripts which generated the reported results.

Changes:

  • Add an “Evaluation” section to the root README linking to the evaluation write-up and notebooks.
  • Introduce evaluation_strategy.md describing qualitative comparisons and quantitative benchmark methodology/results.
  • Add two new notebooks for quantitative evaluation and for testing/comparing other tools.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 13 comments.

File Description
README.md Adds a short entry point linking readers to evaluation docs and notebooks.
evaluation_strategy.md Documents evaluation strategy, qualitative observations, quantitative setup, and summarized metrics.
docs/source/notebooks/quantitative_eval.ipynb Notebook to generate quantitative evaluation datasets and compute metrics.
docs/source/notebooks/test_other_tools.ipynb Notebook to run Presidio/Scrubadub comparisons and export results.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread evaluation_strategy.md Outdated
Comment thread evaluation_strategy.md Outdated
Comment thread evaluation_strategy.md Outdated
Comment thread evaluation_strategy.md Outdated
Comment thread evaluation_strategy.md Outdated
Comment thread docs/source/notebooks/quantitative_eval.ipynb
Comment thread docs/source/notebooks/quantitative_eval.ipynb
Comment thread docs/source/notebooks/quantitative_eval.ipynb Outdated
Comment thread docs/source/notebooks/quantitative_eval.ipynb Outdated
Comment thread docs/source/notebooks/test_other_tools.ipynb
@kimlee87 kimlee87 marked this pull request as ready for review April 13, 2026 14:05
@kimlee87 kimlee87 requested a review from iulusoy April 13, 2026 14:05
Copy link
Copy Markdown
Member

@iulusoy iulusoy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for all your work on this, it looks great!

@sonarqubecloud
Copy link
Copy Markdown

@kimlee87 kimlee87 merged commit d95746c into main Apr 20, 2026
16 checks passed
@kimlee87 kimlee87 deleted the eval branch April 20, 2026 09:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants