You are tasked with finding the closest matching sentences across books, where each book has multiple paragraphs and sentences. Which SQL structure should you use?
What is the advantage of using Euclidean Squared Distance rather than Euclidean Distance in similarity search queries?
You need to generate a vector from the string '[1.2, 3.4]' in FLOAT32 format with 2 dimensions. Which function will you use?
In Oracle Database 23ai, which SQL function calculates the distance between two vectors using the Euclidean metric?
Which statement best describes the capability of Oracle Data Pump for handling vector data in thecontext of vector search applications?
You are asked to fetch the top five vectors nearest to a query vector, but only for a specific category of documents. Which query structure should you use?
A machine learning team is using IVF indexes in Oracle Database 23ai to find similar images in a large dataset. During testing, they observe that the search results are often incomplete, missing relevant images. They suspect the issue lies in the number of partitions probed. How should they improve the search accuracy?
What is the primary function of an embedding model in the context of vector search?
In the following Python code, what is the significance of prepending the source filename to each text chunk before storing it in the vector database?
bash
CollapseWrapCopy
docs = [{"text": filename + "|" + section, "path": filename} for filename, sections in faqs.items() for section in sections]
# Sample the resulting data
docs[:2]
An application needs to fetch the top-3 matching sentences from a dataset of books while ensuring a balance between speed and accuracy. Which query structure should you use?
What happens when you attempt to insert a vector with an incorrect number of dimensions into a VECTOR column with a defined number of dimensions?
Which is NOT a feature or capability related to AI and Vector Search in Exadata?
A database administrator wants to change the VECTOR_MEMORY_SIZE parameter for a pluggable database (PDB) in Oracle Database 23ai. Which SQL command is correct?
Which Python library is used to vectorize text chunks and the user’s question in the following example?
import oracledb
connection = oracledb.connect(user=un, password=pw, dsn=ds)
table_name = "Page"
with connection.cursor() as cursor:
create_table_sql = f"""
CREATE TABLE IF NOT EXISTS {table_name} (
id NUMBER PRIMARY KEY,
payload CLOB CHECK (payload IS JSON),
vector VECTOR
)"""
try:
cursor.execute(create_table_sql)
except oracledb.DatabaseError as e:
raise
connection.autocommit = True
from sentence_transformers import SentenceTransformer
encoder = SentenceTransformer('all-MiniLM-L12-v2')
When generating vector embeddings for a new dataset outside of Oracle Database 23ai, which factor is crucial to ensure meaningful similarity search results?
What is the significance of splitting text into chunks in the process of loading data into Oracle AI Vector Search?
How is the security interaction between Autonomous Database and OCI Generative AI managed in the context of Select AI?
What are the key advantages and considerations of using Retrieval Augmented Generation (RAG) in the context of Oracle AI Vector Search?