Re: [RFC] devel: script to calculate a list of authors.

Subject: Re: [RFC] devel: script to calculate a list of authors.

Date: Wed, 03 Jun 2020 21:02:16 +0300

To: David Bremner, notmuch@notmuchmail.org

Cc:

From: Tomi Ollila


On Wed, Jun 03 2020, David Bremner wrote:

> As an initial heuristic, report anyone with at least 15 lines of code
> in the current source tree. Test corpora are excluded, although
> probabably this doesn't change much about the list of authors
> produced.
> ---
>
> I realized both AUTHORS and debian/copyright are woefully out of
> date. I think it makes sense to keep something like this in the repo,
> both to ease updates and to document a policy. Presuambly 'author '
> should be removed from the output, but I'm guessing Tomi will tear
> this apart anyway ;).

Hi David,

I started doing that before even reading your commit (used less(1) to
look the email from bottom... ;)

I got some idea, but then decided there is no point spending too
many minutes (what was that one 'time management' xkcd again... =D)

anyway, some fun editing the nifty pipeline you write

git ls-files | grep -v -e "$FILE_EXCLUDE" | xargs -n 1 -d \\n | \
    git blame -w --line-porcelain -- | \
    sed -n "/$AUTHOR_EXCLUDE/d; s/^[aA][uU][tT][hH][Oo][rR] //p" | \
    sort -fd | uniq -ic | awk "\$1 >= $THRESHOLD" | sort -nr

If there are more authors or files to exclude, then it is easiest
to just write those out in pipeline, e.g.

grep -v -e 'file-one' -e 'file 2 with spaces' ...

and

sed -n '/author1_ex1/d; /author2_ex1/d; s/^[aA][uU][tT][hH][Oo][rR] //p'

(and, as usual, I always recommend 'set -euf' in shell scripts)

Tomi

>  devel/author-scan.sh | 11 +++++++++++
>  1 file changed, 11 insertions(+)
>  create mode 100644 devel/author-scan.sh
>
> diff --git a/devel/author-scan.sh b/devel/author-scan.sh
> new file mode 100644
> index 00000000..b7b46a33
> --- /dev/null
> +++ b/devel/author-scan.sh
> @@ -0,0 +1,11 @@
> +#!/bin/sh
> +
> +FILE_EXCLUDE='corpora'
> +AUTHOR_EXCLUDE='uncrustify'
> +# based on the FSF guideline, for want of a better idea.
> +THRESHOLD=15
> +
> +git ls-files | grep -v "$FILE_EXCLUDE" |
> +    while read f; do
> +        git blame -w --line-porcelain -- "$f" | grep -I '^author ' | grep -v "$AUTHOR_EXCLUDE"
> +    done | sort -fd | uniq -ic | awk "\$1 >= $THRESHOLD" |  sort -nr
> -- 
> 2.26.2
_______________________________________________
notmuch mailing list
notmuch@notmuchmail.org
https://notmuchmail.org/mailman/listinfo/notmuch

Thread: