Abstract: Query-by-Example Spoken Term Detection (QbE-STD) retrieves relevant audio files corresponding to a spoken query, without relying on explicit word-level textual transcriptions. In ...
BoQ is a new architecture for visual place recognition that learns a set of global learned queries (Bag-of-Queries) to probe the input’s local features via cross-attention, insuring consistent ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results