INDEX
Explanations
phrases indicating an outcome is not very probable
statements or phrases that express improbability or unlikelihood
New Auto-Interp
Negative Logits
ravings
-0.88
Ü
-0.80
lished
-0.77
ocked
-0.76
insula
-0.75
artney
-0.74
ricted
-0.74
CSS
-0.74
aeper
-0.74
aina
-0.74
POSITIVE LOGITS
icably
0.93
theless
0.87
bably
0.84
unlikely
0.76
necessarily
0.72
unanim
0.71
unanimous
0.71
improbable
0.70
coincidence
0.69
necess
0.69
Activations Density 0.019%