Kingfisher Summarize#

Read the Kingfisher Summarize documentation, which covers general usage.

Note

Is the service unresponsive or erroring? Follow these instructions.

Summarize collections#

  1. Connect to the data support server

  2. Open a session in tmux:

    tmux new -s initials-task-description
    
  3. Run the command-line tool, using the /opt/kingfisher-summarize.sh entrypoint. (This runs manage.py indirectly.) For example:

    sudo -u summarize /opt/kingfisher-summarize.sh add --help
    
    sudo -u summarize /opt/kingfisher-summarize.sh add 123 "Added by NAME"
    

    Tip

    Use the --field-lists option to analyze coverage:

    sudo -u summarize /opt/kingfisher-summarize.sh add 123 "added by NAME" --field-lists
    
  4. Close the session by pressing Ctrl-d or by running:

    exit
    

Review log files#

Kingfisher Summarize writes log messages to the /var/log/kingfisher-summarize.log file. The log file is rotated weekly; last week’s log file is at /var/log/kingfisher-summarize.log.1, and earlier log files are compressed at /var/log/kingfisher-summarize.log.2.gz, etc.

Log messages are formatted as:

[date] [hostname] %(asctime)s - %(process)d - %(name)s - %(levelname)s - %(message)s

You can filter messages by topic. For example:

grep NAME /var/log/kingfisher-summarize.log | less

For more information on the topics of log messages, read Kingfisher Summarize’s logging documentation.

Data retention policy#

On the first day of each month, the following are deleted:

  • Schema whose selected collections no longer exist

To protect a schema from deletion, edit the KINGFISHER_SUMMARIZE_PROTECT_SCHEMA environment variable in the salt/kingfisher/summarize/files/.env file.