Build software better, together

aimagelab / LLaVA-MORE

LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

vision-and-language llms llava siglip multimodal-llms llama3 llava-llama3 llama3-vision gemma-2 llama3-1 deepseek-r1 siglip2

Updated Apr 24, 2025
Python

PRITHIVSAKTHIUR / Facial-Emotion-Detection-SigLIP2

Star

Facial-Emotion-Detection-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

image-classification emotion-analysis emotion-detection emotion-recognition huggingface-transformers siglip2

Updated Apr 9, 2025
Python

PRITHIVSAKTHIUR / Augmented-Waste-Classifier-SigLIP2

Star

Augmented-Waste-Classifier-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

google image-classification vit hazard-detection hazard-assessment waste-management huggingface-transformers siglip2

Updated Mar 11, 2025
Python

PRITHIVSAKTHIUR / SigLIP2-MultiDomain-App

Star

SigLIP2 is a vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

encoder image-classification gradio multidomain huggingface-transformers vision-language-model siglip2

Updated May 1, 2025
Python

PRITHIVSAKTHIUR / Age-Classification-SigLIP2

Star

Age-Classification-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to predict the age group of a person from an image using the SiglipForImageClassification architecture.

google vit age-detection huggingface-transformers vision-transformer huggingface-models siglip2

Updated Mar 28, 2025
Python

PRITHIVSAKTHIUR / Human-Action-Recognition

Star

Human-Action-Recognition is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for multi-class human action recognition. It uses the SiglipForImageClassification architecture to predict human activities from still images.

recognition human action huggingface-transformers siglip2

Updated Apr 11, 2025
Python

PRITHIVSAKTHIUR / Anime-Classification-v0.1

Star

Anime-Classification-v1.0 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify anime-related images using the SiglipForImageClassification architecture.

google anime image-classification gradio huggingface huggingface-transformers siglip2 anime-type

Updated Apr 20, 2025
Python

PRITHIVSAKTHIUR / x-bot-profile-detection

Star

x-bot-profile-detection is a SigLIP2-based classification model designed to detect profile authenticity types on social media platforms (such as X/Twitter). It categorizes a profile image into four classes: bot, cyborg, real, or verified. Built on google/siglip2-base-patch16-224.

bot twitter detection torch image-classification x gradio huggingface-transformers siglip2

Updated May 3, 2025
Python

PRITHIVSAKTHIUR / Watermark-Detection-SigLIP2

Star

Watermark-Detection-SigLIP2 is a vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for binary image classification. It is trained to detect whether an image contains a watermark or not, using the SiglipForImageClassification architecture.

detection image-classification gradio watermark huggingface-transformers vision-transformer siglip2

Updated May 1, 2025
Python

PRITHIVSAKTHIUR / Mnist-Digits-SigLIP2

Star

classify handwritten digits (0-9)

numbers mnist-classification image-classification vit gradio digits-recognition digits-classification huggingface-transformers siglip2 0-9

Updated Mar 28, 2025
Python

PRITHIVSAKTHIUR / Fashion-Mnist-SigLIP2

Star

Fashion-Mnist-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images into Fashion-MNIST categories using the SiglipForImageClassification architecture.

google image-classification clothing fashion-mnist huggingface-transformers vision-transformer siglip2

Updated Mar 21, 2025
Python

jesus3476 / Fire-Detection-Siglip2

Star

Fire-Detection-Siglip2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to detect fire, smoke, or normal conditions using the SiglipForImageClassification architecture.

google smoke image-classification llama vit normal fire-detection huggingface huggingface-transformers siglip siglip2

Updated May 15, 2025
Python

PRITHIVSAKTHIUR / Deepfake-vs-Real-8000

Star

Deepfake vs Real is a dataset designed for image classification, distinguishing between deepfake and real images.

detection vit deepfake vision-transformer siglip2

Updated Mar 27, 2025
Python

PRITHIVSAKTHIUR / Food-101-93M

Star

Food-101-93M is a fine-tuned image classification model built on top of google/siglip2-base-patch16-224 using the SiglipForImageClassification architecture. It is trained to classify food images into one of 101 popular dishes, derived from the Food-101 dataset.

food image-classification huggingface-transformers vision-transformer siglip2

Updated Apr 7, 2025
Python

PRITHIVSAKTHIUR / IndoorOutdoorNet

Star

IndoorOutdoorNet is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images as either Indoor or Outdoor using the SiglipForImageClassification architecture..

image-classification gradio indoor outdoors huggingface-transformers vision-transformer siglip2

Updated Apr 25, 2025
Python

PRITHIVSAKTHIUR / Clipart-126-DomainNet

Star

Clipart-126-DomainNet is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify clipart images into 126 domain categories using the SiglipForImageClassification architecture

art classification image-classification llama demo-app gradio torchvision huggingface-transformers vision-transformer huggingface-spaces siglip2

Updated Mar 26, 2025
Python

PRITHIVSAKTHIUR / Multilabel-GeoSceneNet

Star

Multilabel-GeoSceneNet is a vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for multi-label image classification. It is designed to recognize and label multiple geographic or environmental elements in a single image using the SiglipForImageClassification architecture.

map geospatial landscape spaces gradio huggingface-transformers hugging-face siglip vision-encoder siglip2 geoscenenet

Updated Apr 23, 2025
Python

PRITHIVSAKTHIUR / Rice-Leaf-Disease

Star

Rice-Leaf-Disease is an image classification model fine-tuned from google/siglip2-base-patch16-224 for detecting and categorizing diseases in rice leaves. It is built using the SiglipForImageClassification architecture and helps in early identification of plant diseases for better crop management.

leaf vision image-classification image-detection gradio rice-diseases huggingface-transformers siglip siglip2

Updated Apr 22, 2025
Python

PRITHIVSAKTHIUR / Coral-Health

Star

Coral-Health is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify coral reef images into two health conditions using the SiglipForImageClassification architecture.

health healthy coral coral-reefs huggingface-transformers vision-encoder siglip2 bleached

Updated Apr 28, 2025
Python

PRITHIVSAKTHIUR / Face-Mask-Detection

Star

Face-Mask-Detection is a binary image classification model based on google/siglip2-base-patch16-224, trained to detect whether a person is wearing a face mask or not. This model can be used in public health monitoring, access control systems, and workplace compliance enforcement.

gradio face-mask-detection facemask-detection huggingface-transformers face-mask-classification vision-transformer siglip2

Updated May 12, 2025
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

siglip2

Here are 45 public repositories matching this topic...

aimagelab / LLaVA-MORE

PRITHIVSAKTHIUR / Facial-Emotion-Detection-SigLIP2

PRITHIVSAKTHIUR / Augmented-Waste-Classifier-SigLIP2

PRITHIVSAKTHIUR / SigLIP2-MultiDomain-App

PRITHIVSAKTHIUR / Age-Classification-SigLIP2

PRITHIVSAKTHIUR / Human-Action-Recognition

PRITHIVSAKTHIUR / Anime-Classification-v0.1

PRITHIVSAKTHIUR / x-bot-profile-detection

PRITHIVSAKTHIUR / Watermark-Detection-SigLIP2

PRITHIVSAKTHIUR / Mnist-Digits-SigLIP2

PRITHIVSAKTHIUR / Fashion-Mnist-SigLIP2

jesus3476 / Fire-Detection-Siglip2

PRITHIVSAKTHIUR / Deepfake-vs-Real-8000

PRITHIVSAKTHIUR / Food-101-93M

PRITHIVSAKTHIUR / IndoorOutdoorNet

PRITHIVSAKTHIUR / Clipart-126-DomainNet

PRITHIVSAKTHIUR / Multilabel-GeoSceneNet

PRITHIVSAKTHIUR / Rice-Leaf-Disease

PRITHIVSAKTHIUR / Coral-Health

PRITHIVSAKTHIUR / Face-Mask-Detection

Improve this page

Add this topic to your repo