INDEX
Explanations
adjectives related to opinions or criticisms
instances of complex or nuanced opinions and observations
New Auto-Interp
Negative Logits
UF
-0.77
asio
-0.76
swick
-0.74
bilt
-0.73
tz
-0.69
EStream
-0.68
SHIP
-0.68
ioxide
-0.68
aha
-0.66
OME
-0.65
POSITIVE LOGITS
albeit
1.15
but
0.94
though
0.90
however
0.90
although
0.86
huh
0.82
albeit
0.81
whereas
0.80
namely
0.78
except
0.77
Activations Density 0.260%