In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Generative AI > Hugging Face > Sentence Embeddings with Hugging Face Sentence Transformers

Sentence Embeddings with Hugging Face Sentence Transformers

Author: Venkata Sudhakar

Sentence embeddings convert text into dense numerical vectors that capture semantic meaning. Two sentences with similar meaning will have embeddings that are close together in vector space, even if the exact words differ. The Hugging Face sentence-transformers library provides easy access to models like all-MiniLM-L6-v2 that produce high-quality 384-dimensional sentence embeddings. At ShopMax India, embeddings power the semantic product search engine that returns relevant results even when customers use different words than those in the product catalog.

Cosine similarity is the standard metric for comparing embeddings. It measures the cosine of the angle between two vectors and returns a value between -1 and 1. A value close to 1 means the sentences are semantically very similar. The sentence-transformers library includes utility functions for computing cosine similarity directly.

The below example shows how to generate sentence embeddings and find the most similar product to a customer query.

from sentence_transformers import SentenceTransformer, util

# Load a lightweight sentence embedding model
model = SentenceTransformer("all-MiniLM-L6-v2")

# ShopMax India - product catalog entries
products = [
    "ShopMax ProBook laptop with i7 processor and 16GB RAM",
    "ShopMax TurboCharge wireless noise-cancelling earbuds",
    "ShopMax SmartWatch with heart rate monitor and GPS",
    "ShopMax PowerBank 20000mAh with fast charging support",
    "ShopMax MechKeys mechanical keyboard with RGB lighting"
]

# Customer search queries
queries = [
    "I need a good laptop for office work",
    "looking for wireless headphones",
    "fitness tracker that monitors health"
]

# Encode products and queries
product_embeddings = model.encode(products, convert_to_tensor=True)

for query in queries:
    query_embedding = model.encode(query, convert_to_tensor=True)
    scores = util.cos_sim(query_embedding, product_embeddings)[0]
    best_idx = scores.argmax().item()
    print(f"Query: {query}")
    print(f"Best match: {products[best_idx]} (score: {scores[best_idx]:.3f})")
    print()

It gives the following output,

Query: I need a good laptop for office work
Best match: ShopMax ProBook laptop with i7 processor and 16GB RAM
(score: 0.712)

Query: looking for wireless headphones
Best match: ShopMax TurboCharge wireless noise-cancelling earbuds
(score: 0.681)

Query: fitness tracker that monitors health
Best match: ShopMax SmartWatch with heart rate monitor and GPS
(score: 0.743)

The all-MiniLM-L6-v2 model is only 80MB in size and runs efficiently on CPU, making it suitable for deployment on standard web servers without GPU hardware. For production at ShopMax India, product embeddings should be pre-computed and stored in a vector database so that search queries only require encoding the query text at runtime, not the full catalog on every request.

Send your comments, suggestions or queries regarding this site to [email protected].