Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Function to set Language field #2058

Open
llrs opened this issue Sep 15, 2024 · 2 comments
Open

Function to set Language field #2058

llrs opened this issue Sep 15, 2024 · 2 comments

Comments

@llrs
Copy link
Contributor

llrs commented Sep 15, 2024

It would be great if usethis could help developers to specify the language of their packages. This is currently only done in 9% of packages, and there are only a 3% that are in other languages. Having the tools to create them would be great
According to WRE:

A ‘Language’ field can be used to indicate if the package documentation is not in English: this should be a comma-separated list of standard (not private use or grandfathered) IETF language tags as currently defined by RFC 5646 (https://www.rfc-editor.org/rfc/rfc5646, see also https://en.wikipedia.org/wiki/IETF_language_tag), i.e., use language subtags which in essence are 2-letter ISO 639-1 (https://en.wikipedia.org/wiki/ISO_639-1) or 3-letter ISO 639-3 (https://en.wikipedia.org/wiki/ISO_639-3) language codes.

Is there interest in a PR to add this?

I think it is easy to add a field checking that the language is well formatted (nchar >= 2 && lower case), but perhaps an ideal solution would be to check for the official rules (which I don't know how easy would be that).

A rough draft of the behavior would be something like:

use_language(c("ca", "es"))
ℹ Adding ca as languageAdding es as language
use_language("ca")
ℹ ca language is already present
@llrs
Copy link
Contributor Author

llrs commented Sep 17, 2024

I got some feedback asking why would one need such function while create_package already shows how to setup Language. My reasoning is that in old or new packages sometimes the development is in one language and then some other language is used after the package is created.

For example data.table is developed in English and it doesn't have the Language field. But it also have error messages translated to Spanish, Mandarin and Portuguese, and this is not show in the Description. Showing these language would help users to find it and use the translations.

@jonthegeek
Copy link
Contributor

It would be nice to apply some simple formatting rules to at least standardize the types of things that already appear in tools::CRAN_package_db() |> dplyr::count(Language, sort = TRUE). For example, all of these real examples would ideally become "en-US":

  • 'en-US'
  • EN-US
  • en-Us
  • en-us
  • en_US
  • en-USx (maybe, but this one might at most get flagged as weird)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants