INDEX
    Explanations

    characters and relationships in storytelling contexts

    New Auto-Interp
    Negative Logits
    dio
    -0.18
    odash
    -0.17
    /includes
    -0.15
    .newBuilder
    -0.15
    jab
    -0.15
    .LENGTH
    -0.14
    ebin
    -0.14
    poke
    -0.14
    metro
    -0.14
    metros
    -0.14
    POSITIVE LOGITS
     teams
    0.24
     soon
    0.24
     discovers
    0.23
     encounters
    0.22
     discover
    0.20
     aw
    0.20
     learns
    0.20
     emb
    0.19
     unc
    0.19
     faces
    0.19
    Act Density 0.138%

    No Known Activations