INDEX
    Explanations

    frequently occurring letters in the text

    New Auto-Interp
    Head Attr Weights
    0:0.03
    1:0.02
    2:0.19
    3:0.08
    4:0.08
    5:0.05
    6:0.24
    7:0.02
    8:0.05
    9:0.08
    10:0.06
    11:0.04
    Negative Logits
     headlights
    -1.55
    imeters
    -1.50
     scissors
    -1.32
     magnets
    -1.20
     cloaked
    -1.18
     hardness
    -1.17
     Jinn
    -1.14
     grit
    -1.14
     Lithuan
    -1.11
     evenly
    -1.09
    POSITIVE LOGITS
    rar
    1.51
    ヴァ
    1.50
    anto
    1.39
    intend
    1.37
    esta
    1.32
    1.30
    avan
    1.29
    imate
    1.29
    1.28
    UTE
    1.28
    Act Density 0.009%

    No Known Activations