Skip to content
View PRITHIVSAKTHIUR's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@Stranger-Zone @Stranger-Guard

Block or report PRITHIVSAKTHIUR

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
PRITHIVSAKTHIUR/README.md

Hey 👋 What's up?

hi, i am prithiv!

i am a graduate engineer [ug 2024], information technology, gcee
focused on working in llm training and enhancements, improving multimodal ai capabilities.
GitHub Stats & Activity
GitHub Streak GitHub Stats
Activity Graph

🔬 Experimental repositories

DREX Flux-LoRA-DLC Camel-Doc-OCR
GitHub GitHub GitHub
Model Models Model
Document Retrieval and Extraction eXpert model specialized for content extraction and analysis. Built on Qwen2.5-VL architecture. FLUX.1-dev diffusion model with 255+ community LoRAs collection. Easy-to-use Gradio interface for diverse artistic styles. Fine-tuned Qwen2.5-VL-7B model for document comprehension, retrieval and content extraction capabilities.
Watermark-Detection-SigLIP2 Facial-Emotion-Detection-SigLIP2 FineTuning-SigLIP-2
GitHub GitHub GitHub
Model Model Blog
Vision-language model fine-tuned from SigLIP2 for binary watermark detection in images using advanced classification architecture. Image classification model fine-tuned from SigLIP2 for facial emotion recognition with single-label classification capabilities. Comprehensive guide for fine-tuning SigLIP 2 models for single/multi-label image classification tasks with practical examples.

🐢Progressing slowly, like a tortoise.

Stranger Zone Stranger Guard
GitHub GitHub
HuggingFace HuggingFace
Building illustration adapters for diffusion models, The Stranger Zone specializes in intelligence development, focusing on fine-tuning models for computer vision ; text-to-image specialized adapters (LoRA). Stranger Guard specializes in building strict content moderation models, with a core focus on advanced computer vision tasks. Our team develops precision-driven AI systems capable of detecting, classifying, and moderating visual content at scale.

Pinned Loading

  1. Doc-VLMs-v2-Localization Doc-VLMs-v2-Localization Public

    Doc-VLMs-v2-Localization is a demo app for the Camel-Doc-OCR-062825 model, fine-tuned from Qwen2.5-VL-7B-Instruct for advanced document retrieval, extraction, and analysis. It enhances document und…

    Python 1

  2. FineTuning-SigLIP-2 FineTuning-SigLIP-2 Public

    Fine-Tuning SigLIP 2 for Single/Multi-Label Image Classification. Image classification vision-language encoder model fine-tuned for Image Classification Tasks

    Jupyter Notebook 23 2

  3. Qwen2.5-VL-Video-Understanding Qwen2.5-VL-Video-Understanding Public

    The Qwen2.5-VL-7B-Instruct model is a multimodal AI model developed by Alibaba Cloud that excels at understanding both text and images. It's a Vision-Language Model (VLM) designed to handle various…

    Python 1

  4. OCR-ReportLab OCR-ReportLab Public

    A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B) On T4 GPU - free tier

    Jupyter Notebook 3 1

  5. DREX DREX Public

    drex-062225-exp (document retrieval and extraction expert) model is a specialized fine-tuned version of docscopeocr-7b-050425-exp, optimized for document retrieval, content extraction, and analysis…

    Python 1

  6. Flux-LoRA-DLC Flux-LoRA-DLC Public

    Experience the power of the FLUX.1-dev diffusion model combined with a massive collection of 255+ community-created LoRAs! This Gradio application provides an easy-to-use interface to explore diver…

    Python 10 1