Local cross-encoder rerank service using BAAI/bge-reranker-base model via michaelf34/infinity:0.0.68. Serves /rerank endpoint on port 7998 for LiteLLM proxy integration. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>