Skip to content

How to compute idfs for a custom set of packages #86

Open
@Bisaloo

Description

@Bisaloo

I am trying to compute idfs for a different corpus but I cannot figure out how.

The docs state that it is the output of pkgmatch_bm25()

#' @param idfs Inverse Document Frequency tables for all rOpenSci packages,
#' generated from \link{pkgmatch_bm25}. If not provided, pre-generated IDF
#' tables will be downloaded and stored in a local cache directory.

But the inputs of pkgmatch_bm25() don't match what I would expect here (I would expect the same inputs as pkgmatch_embeddings_from_pkgs()) and the output doesn't seem to match what pkgmatch_similar_pkgs() is expecting anyways.

In other words, if such as function doesn't exist yet, I would like a function pkgmatch_idfs_from_pkgs() which would be the equivalent of pkgmatch_embeddings_from_pkgs() for the idfs argument in pkgmatch_similar_pkgs().

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions