I have a collection test
, and a compound index on it with two fields
db.test.createIndex({ i: 1, j: 1 })
When I execute following pipeline
db.test.aggregate([{ $sort: { i: 1, j: 1 } }], { allowDiskUse: false })
it works fine. But this pipeline
db.test.aggregate([{ $sort: { i: 1, j: -1 } }], { allowDiskUse: false })
fails with the error that says "Sort exceeded memory limit". The reason is more less clear. The sort order in the pipeline does not match the order in the index and therefore mongodb decides not to use the index and sort the whole collection, which, in turn, does not fit in memory.
However I suspect that mongodb could be slightly smarter. Instead of sorting the whole collection it could use the index to delimit blocks of documents, for which field i
is the same, and then sort documents only within such blocks. The documents of the same block have more chances to fit in memory and therefore the pipeline can perform more efficiently. Can I make mongodb server do so? How? If not, what prevents this.
2
Answers
It seems mongod do not identify that can use the index , but you can try to hint him as follow:
A similar question was asked a few days later here. As @Tom Slabbaert mentioned in the comments, the answer is that no, at the time of writing, MongoDB does not appear to support using the index in the situation described to provide an incremental sort. There is no (non-hacky) way to force the system to do this, especially in a way that would be flexible and deliver performance benefits.
Some additional things to consider with respect to the presumed goal of improved performance:
j
in descending order to allow it to support the sort)?allowDiskUse
set tofalse
. Is there a reason for that? Setting it totrue
should allow the operation to complete successfully.allowDiskUse
now defaults totrue
beginning in version 6.0.Edit: Per the comments, the request for this functionality in MongoDB appears to be tracked either here or here.