INDEX
Explanations
the expression "more or less"
phrases that suggest a degree of approximation or uncertainty
New Auto-Interp
Negative Logits
osa
-0.70
ller
-0.70
Burgess
-0.68
Sands
-0.68
hower
-0.67
Seat
-0.63
amsung
-0.62
Mitchell
-0.62
icro
-0.61
ividual
-0.60
POSITIVE LOGITS
etheless
0.87
aneously
0.73
ingu
0.67
desirable
0.67
likely
0.67
superficial
0.66
than
0.66
mileage
0.64
aggro
0.63
imate
0.63
Activations Density 0.019%