forever-chat-format

v1.2.1

Published

3 years ago

Validators for a chat exporter to ensure data conforms to forever chat format standards

Downloads

0High
0Medium
0Low

jeffkeen

Forever Chat Format

Why

Because in the world where our communication is siloed into Google Hangouts, Facebook, iMessage, WhatsApp, etc etc it's getting hard to hold onto conversations for archival purposes. First thing we need is a universal format that we can export those proprietary chats into that's easy to understand, and that can be relied upon.

Basic Format

Output should be an array of hashes, and each object in the array is a JSON object defined as follows:

Chat Object Definition

Name | Type | Description ----------- | ------- | ------------- sha | string | sha1 of [sender, receiver, date, text, service] sender | string | The handle of the person sending the message receiver | string | The handle of the person receiving the message is_from_me | boolean | true if you sent the message send_error | boolean | true if the message didn't send date | ISO-8601 datetime | date message was sent/received by sender service | string | Service message was sent using. All lowercase, dasherized. 'imessage', 'google-hangouts', 'facebook-messenger', 'sms' date_read | ISO-8601 datetime | (optional) date message was sent/read by receiver date_delivered | ISO-8601 datetime | (optional) date message was delivered by receiver participants | array | (see definition below) An array of strings of handles involved in the conversations. Used for grouping. For a two person conversation, this has two entries in it. Sender and receiver are always here. message_text | string | text representation of the message message_segments | array | (see definition below) ordered segments of content as it appears in the message attachments | array | (see definition below) array of attachments associated_sha | string | sha of message this message is in response to. Generally used for reactions _debug | object | (optional) debug information to help trace back to the source of the data

Message Segment Definition

Name | Type | Possible Values | Description ----------- | ------- | ----------------- | ------------- type | string | text, link, file | base segment type file_type | string | null, or a mime_type | only valid when type == file path | string | url or file path |
text | string | | the rendered name of the link, or the text content for the text, or optionally

Text Segment

{"type": "text", "text": "Why hello there"}

Link Segment

{"type": "link", "text": "Whoa, unreal", "path": "http://latlmes.com/technology/why-we-need-a-new-chat-format-1"}

File Segment

{"type": "file", "file_type": "image/png", "path": "/path/to/image.png"}

Reaction Segment

{"type": "reaction", "reaction_type": "laughed"}

Attachments

Array of attachments in the order they were received. Each attachment has the following properties. This information should also be made available in message_segments.

Name | Type
----------- | -------
path | string
type | string

Usage

The included tests check whether some json is valid forever chat format.

var foreverJson    = importer("test.json"); // done by an exporter

var {validate, runTests} = require('forever-chat-format-tests');

runTests(foreverJson); // runs mocha tests on data

The last step of your exporter should be to pass your exported data to the prepare function. It will check that the data conforms to the forever-chat-format spec and then return a hash with two keys: messages, and validations. Messages is all the data your exporter returned, and validations are what you see below:


"validations": {
  "version":           1.2 // forever chat version
  "checkedCount":      100 // number of records checked
  "errorCount":        0   // number of records with errors
  "recordsWithErrors":
  "ruleResults": { /* each rule keyed by name with errorCount and "erroredRecords" */
    "unique-sha": {
      "description":    "Each record should have a unique sha",
      "errorCount":     0,
      "erroredRecords": []
    }

    ...
  }
}

Testing

var foreverJson   = importer("test.json"); // done by an exporter
var {validate, runTests} = require('forever-chat-format-tests');
runTests(foreverJson);