INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     geared
    0.96
     seasoned
    0.86
     influenced
    0.85
     taking
    0.81
     fed
    0.81
     def
    0.80
     dominated
    0.79
     computer
    0.79
     securing
    0.78
     caused
    0.77
    POSITIVE LOGITS
    "$
    1.16
    ("
    1.02
    //
    1.02
    $"
    1.02
    {},
    1.00
    //"
    0.98
    "${
    0.97
    "_
    0.96
    $:
    0.93
    "(
    0.92
    Act Density 0.055%

    No Known Activations