INDEX
Explanations
comparisons between different things or situations
phrases that indicate certainty or the assertion of facts
New Auto-Interp
Negative Logits
Chamberlain
-0.57
umbn
-0.55
airo
-0.52
hoff
-0.51
Watch
-0.50
åij
-0.50
onlook
-0.50
ioch
-0.49
aires
-0.49
ourt
-0.49
POSITIVE LOGITS
impossible
1.09
untrue
0.96
impractical
0.94
useless
0.93
feasible
0.91
omorphic
0.91
advisable
0.91
possible
0.90
unus
0.90
irrelevant
0.88
Activations Density 0.335%