fix: Avoid deep copying in ModelContainer to resolve pickling errors #559

laksi1999 · 2025-05-16T00:56:09Z

Description

This pull request introduces a new model_objects parameter in the ModelContainer class, allowing users to pass pre-created model objects directly. The primary change eliminates the need for deep copying model_object, which previously caused "cannot pickle" errors when dealing with non-serializable objects, such as those from Driverless AI. Instead, the updated implementation checks for a model_objects list and assigns a model instance from it, ensuring seamless integration with existing tools.

Motivation and Context

The original implementation relied on deep copying experiment objects, leading to serialization issues when handling non-picklable objects. This limitation prevented users from effectively integrating their models, particularly when using community-standard tools. By introducing the model_objects parameter, this update resolves the serialization issue while maintaining backward compatibility with the existing single model_object approach.

Type of Change

fix: A bug fix

How to Test

Create a ModelContainer instance including the model_object parameter along with the new model_objects parameter with a pre-created list of model_object objects.
Create a use case object that utilizes the ModelContainer instance.
Run fairness and feature importance analysis for the created use case object.
Validate that no "cannot pickle" errors occur when interacting with Driverless AI objects.
Verify that the original functionality remains intact when only using the single model_object without the model_objects parameter.

Checklist

Please check all the boxes that apply to this pull request using "x":

I have tested the changes locally and verified that they work as expected.
I have added or updated the necessary documentation (README, API docs, etc.).
I have added appropriate unit tests or functional tests for the changes made.
I have followed the project's coding conventions and style guidelines.
I have rebased my branch onto the latest commit of the main branch.
I have squashed or reorganized my commits into logical units.
I have added any necessary dependencies or packages to the project's build configuration.
I have performed a self-review of my own code.
I have read, understood and agree to the Developer Certificate of Origin below, which this project utilises.

Screenshots (if applicable)

N/A

Additional Notes

This change ensures that non-picklable objects can be utilized without requiring workarounds, improving compatibility with community tools and frameworks.

Developer Certificate of Origin

Developer Certificate of Origin
Version 1.1

Copyright (C) 2004, 2006 The Linux Foundation and its contributors.

Everyone is permitted to copy and distribute verbatim copies of this
license document, but changing it is not allowed.


Developer's Certificate of Origin 1.1

By making a contribution to this project, I certify that:

(a) The contribution was created in whole or in part by me and I
   have the right to submit it under the open source license
   indicated in the file; or

(b) The contribution is based upon previous work that, to the best
   of my knowledge, is covered under an appropriate open source
   license and I have the right under that license to submit that
   work with modifications, whether created in whole or in part
   by me, under the same open source license (unless I am
   permitted to submit under a different license), as indicated
   in the file; or

(c) The contribution was provided directly to me by some other
   person who certified (a), (b) or (c) and I have not modified
   it.

(d) I understand and agree that this project and the contribution
   are public and that a record of the contribution (including all
   personal information I submit with it, including my sign-off) is
   maintained indefinitely and may be redistributed consistent with
   this project or the open source license(s) involved.

imda-benedictlee · 2025-05-27T07:13:22Z

Hi @laksi1999 Thanks for the PR contribution. I will take a look at the PR together with the team.

imda-benedictlee · 2025-05-27T07:30:08Z

Hi @laksi1999, can I check if you have sample non-serialisable objects to provide for testing this PR? Also, would like to check if this is run via the Notebook and CLI Standalone Plugin?

imda-benedictlee · 2025-05-28T08:06:11Z

Unit Test for Veritas Passed
unit-test.txt

Veritas CLI Ran Successfully
cli-test.txt

Veritas CLI Docker Ran Successfully
cli-test-docker.txt

Veritas Example Notebooks Ran Successfully
CM_output.zip
PUW_output.zip
CS_output.zip
BaseRegression_output.zip
BaseClassification_output.zip

laksi1999 · 2025-06-13T10:50:47Z

@imda-benedictlee Thanks for reviewing the PR.

I’ve attached a notebook that reproduces the error encountered when calling feature_importance() from the aiverify_veritastool:
TypeError: cannot pickle 'daimojo.cppmojo.model' object.

In this notebook, I used a MOJO model. I’ve also included the corresponding model file and the datasets used. Please see the attached Veritas_artifacts.zip

However, since you won’t be able to test the MOJO model in your environment due to authentication restrictions, I’ve also attached a scikit-learn model for testing purposes. Along with that, I’ve included two notebooks:

The first shows how the feature_importance() function fails with the current implementation due to a deep copy issue.
The second shows how this PR resolves that issue using a modified version of the aiverify_veritastool.

Please see the attached Veritas_artifacts_scikit_learn.zip for the artifacts.

imda-benedictlee · 2025-06-16T02:57:21Z

Hi @laksi1999, thanks for the help on this. I will take a look and review the PR.

timlrx

The leave one out calculation in the feature importance method uses a ThreadPoolExecutor to run multiple threads concurrently, each analysing a different protected variable. Without deep copying there might be race conditions.

I get that allowing a model_objects field would bypass the serialization but it introduces other issues:

How do we know the number of model_objects to pass?
The second thread onwards gets a different model_object or fallback to deepcopy
There might also be a race condition with multiple threads calling pop on the same list.

Open to suggestions on how to avoid serialization but we need to create multiple identical copies for parallel processing, not to manage a pool of different model objects.

timlrx · 2025-06-18T03:33:18Z

...ns/aiverify.stock.veritas/algorithms/veritastool/aiverify_veritastool/principles/fairness.py

+            model_obj = getattr(model_param, "model_object", None)
+            if hasattr(model_param, 'model_objects') and model_param.model_objects:
+                model_object = model_param.model_objects.pop(0)


What is passed in model_objects? How do we ensure that there is a sufficient number of model_objects to for the threads to access?

laksi1999 · 2025-06-19T13:19:17Z

@timlrx Based on your feedback, I propose using a lightweight ModelObjectsWrapper, which provides a pop() method that loads a new model instance from disk on demand.

from abc import ABC, abstractmethod
from typing import Any

class ModelObjectsWrapper(ABC):
    @abstractmethod
    def pop(self, index) -> Any:
        """Return a new model object instance."""
        pass

For example, We can use it like below:

import joblib

class MyModelObjectsWrapper(ModelObjectsWrapper):
    def __init__(self, path: str):
        self._path = path

    def pop(self, index) -> Any:
        return joblib.load(self._path)

model_objects = MyModelObjectsWrapper("/path/to/model")

Why This Approach Solves the Problem:

No need to predefine the number of models: Each thread gets a fresh model by calling pop(), so thread count doesn’t matter.
No shared list = no race condition: There's no shared mutable state; each call is independent.
Avoids deepcopy fallback: Models are loaded cleanly, avoiding serialization issues.

I’ve attached the notebook with the current solution. veritas_with_iris_dataset_with_success.zip

The pop method uses an index because the earlier implementation relied on index. If the current proposal makes sense, I can simplify the code by using pop() and removing the index logic.

Please let me know your thoughts on this.

laksi1999 and others added 7 commits May 16, 2025 08:40

support for driverless ai

5ec9299

Merge branch 'main' into feat/integration-with-drivererless-ai

73059a6

support for driverless ai

a55e63c

Merge branch 'main' into feat/integration-with-drivererless-ai

3583bc1

Merge branch 'main' into feat/integration-with-drivererless-ai

8f3c0db

Merge branch 'main' into feat/integration-with-drivererless-ai

dc3b4bd

Merge branch 'main' into feat/integration-with-drivererless-ai

29fe1b1

imda-benedictlee self-assigned this May 27, 2025

imda-benedictlee requested review from timlrx and imda-benedictlee May 28, 2025 06:53

laksi1999 added 2 commits June 9, 2025 12:18

Merge branch 'main' into feat/integration-with-drivererless-ai

43ea255

Merge branch 'main' into feat/integration-with-drivererless-ai

3525174

Merge branch 'main' into feat/integration-with-drivererless-ai

e5815bf

timlrx reviewed Jun 18, 2025

View reviewed changes

Merge branch 'main' into feat/integration-with-drivererless-ai

3939dff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Avoid deep copying in ModelContainer to resolve pickling errors #559

fix: Avoid deep copying in ModelContainer to resolve pickling errors #559

Uh oh!

laksi1999 commented May 16, 2025

Uh oh!

imda-benedictlee commented May 27, 2025 •

edited

Loading

Uh oh!

imda-benedictlee commented May 27, 2025 •

edited

Loading

Uh oh!

imda-benedictlee commented May 28, 2025

Uh oh!

laksi1999 commented Jun 13, 2025

Uh oh!

imda-benedictlee commented Jun 16, 2025

Uh oh!

timlrx left a comment •

edited

Loading

Uh oh!

timlrx Jun 18, 2025

Uh oh!

laksi1999 commented Jun 19, 2025

Uh oh!

Uh oh!

fix: Avoid deep copying in ModelContainer to resolve pickling errors #559

Are you sure you want to change the base?

fix: Avoid deep copying in ModelContainer to resolve pickling errors #559

Uh oh!

Conversation

laksi1999 commented May 16, 2025

Description

Motivation and Context

Type of Change

How to Test

Checklist

Screenshots (if applicable)

Additional Notes

Uh oh!

imda-benedictlee commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

imda-benedictlee commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

imda-benedictlee commented May 28, 2025

Uh oh!

laksi1999 commented Jun 13, 2025

Uh oh!

imda-benedictlee commented Jun 16, 2025

Uh oh!

timlrx left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

timlrx Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

laksi1999 commented Jun 19, 2025

Uh oh!

Uh oh!

imda-benedictlee commented May 27, 2025 •

edited

Loading

imda-benedictlee commented May 27, 2025 •

edited

Loading

timlrx left a comment •

edited

Loading