ARSeek identifies API resources by jointly analyzing code snippets and natural-language discussions on Stack Overflow. By combining code-based and text-based signals, ARSeek accurately locates the most relevant API documentation and usage examples for a developer query, outperforming approaches that rely on a single modality.
When developers look for API resources, they rely on two complementary information sources: code (showing how an API is called) and discussion (explaining why and when to use it). ARSeek is a retrieval system that jointly exploits both modalities from Stack Overflow to identify the most relevant API resources for a given developer query.
ARSeek consistently outperforms code-only and text-only baselines on the API resource identification benchmark, demonstrating that the two modalities provide complementary signals that are stronger in combination.
Published at: IEEE/ACM International Conference on Program Comprehension (ICPC) 2022 · Citations: 6