New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sorting giving incorrect results #30691
Comments
Is this repeatable? Is coverage 100% in both cases? |
How do I set it to 1.0 without setting any value for max-hits-per-partition?
|
@bratseth Coverage is 100%, I have shared response with trace level with you over secure channel |
This works just fine:
|
@nehajatav vespa-get-config -n vespa.config.search.dispatch -i feed/component/dispatcher.<insert content cluster name here> | grep topKProbability There are also slightly different count for total number documents in the two dumps: The dumps provided indicates that the top-k setting has not been correctly propagated. There is a slightly skew in the distribution of hits, with the node 2 reporting more hits than 0 and 1. The slight change in ordering was caused by additional hits from node 2 that have a lexical ordering lower than the highest in the 3k dump. |
@bjorncs the total count may be due to increasing docs in the cluster |
@nehajatav
You can use
Use the output to determine the exact arguments to |
Describe the bug
Query1: The below gives 4000 lexid sorted by id field as expected
{ "hits" : 0, "model.searchPath" : "/0", "yql" : "select '[docid]' from sources * where !(range( myDate_t, -Infinity, Infinity )) AND (range(date,960892397000,1085589357000)) order by '[docid]' asc limit 4000 offset 0", "timeout" : "120s" }
Query2: The below gives 3000 lexid sorted by id field
{ "hits" : 0, "model.searchPath" : "/0", "yql" : "select '[docid]' from sources * where !(range( myDate_t, -Infinity, Infinity )) AND (range(date,960892397000,1085589357000)) order by '[docid]' asc limit 3000 offset 0", "timeout" : "120s" }
Below are the observations that we dont expect with default top-k-probability
Expected behavior
Ranking should nearly be the same for both queries
Environment (please complete the following information):
Vespa version
8.221.29
The text was updated successfully, but these errors were encountered: