You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In addition to natural language text, a user query can contain code snippets. Currently, codegate computes vector embeddings for the entire user query including the code snippets. This can lead to poor similarity search results.
So codegate should be modified to extract the code snippets from the user query. Bad packages if any should be extracted from the code snippets using tree-sitter (similar to the output code snippets). The natural text portion of the user query should be embedded and used for similarity search.
In addition to natural language text, a user query can contain code snippets. Currently, codegate computes vector embeddings for the entire user query including the code snippets. This can lead to poor similarity search results.
So codegate should be modified to extract the code snippets from the user query. Bad packages if any should be extracted from the code snippets using tree-sitter (similar to the output code snippets). The natural text portion of the user query should be embedded and used for similarity search.
Related to #475
The text was updated successfully, but these errors were encountered: