INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Fairfax
    -0.58
    Viitteet
    -0.57
     Springfield
    -0.54
     umani
    -0.52
     Prem
    -0.51
    DataAnnotations
    -0.50
     nieuws
    -0.50
     Redmond
    -0.50
     humanidade
    -0.49
     Richmond
    -0.49
    POSITIVE LOGITS
    boat
    1.38
    Boat
    1.34
     Boat
    1.30
     boat
    1.29
     BOAT
    1.20
    Boats
    1.16
    boats
    1.08
     Boats
    1.05
     boats
    1.05
     Boating
    0.99
    Act Density 0.009%

    No Known Activations