INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     inch
    -0.07
    .");↵
    -0.07
     Horse
    -0.07
    	tree
    -0.06
     truck
    -0.06
     Stick
    -0.06
    _wall
    -0.06
     Truck
    -0.06
    .';↵
    -0.06
     Higgins
    -0.06
    POSITIVE LOGITS
     procur
    0.06
     prefixed
    0.06
    ательно
    0.06
    suffix
    0.06
    pector
    0.06
     Apostle
    0.06
    επ
    0.06
    unicorn
    0.06
    οκ
    0.06
     BorderSide
    0.06
    Act Density 0.001%

    No Known Activations