Skip to content

[Feat] Add streaming support to Codestral Mamba #212

Open
@xNul

Description

@xNul

Python -VV

N/A

Pip Freeze

N/A

Reproduction Steps

N/A

Expected Behavior

N/A

Additional Context

N/A

Suggested Solutions

mamba-ssm has added streaming support for the Mamba architecture Codestral Mamba uses via the streamer parameter which can be passed as an argument in this function call:

output = model.model.generate(
So now it should be pretty easy to implement streaming support for Codestral Mamba in mistral-inference.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions