INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     AssemblyCompany
    -0.44
    hyrchwyd
    -0.44
    Coordonnées
    -0.37
    verständnis
    -0.34
    rías
    -0.33
    Награды
    -0.32
     Олег
    -0.32
    nodeList
    -0.32
     Tob
    -0.31
    κέ
    -0.31
    POSITIVE LOGITS
    fire
    3.27
    Fire
    2.81
     fire
    2.77
    FIRE
    2.70
     Fire
    2.58
     FIRE
    2.42
    fires
    2.36
     fires
    2.02
    fired
    1.98
     Fires
    1.80
    Act Density 0.007%

    No Known Activations