Generation length is limited to 2048 tokens.Qwen3 model accuracy is low

Hi Team,
I am using local-completions to evaluate my model deployed in local.
The generated response is limited to 2048 tokens,and not able to increase it throught model_args or gen_kwargs max_length=120000 also.
Did anyone face this issue?