INDEX
Explanations
phrases that refer to similarities or comparisons between different situations, especially when discussing historical events or political situations
New Auto-Interp
Negative Logits
*=-
-0.91
ase
-0.88
ä¿
-0.85
Frag
-0.83
icum
-0.80
76561
-0.79
Interstitial
-0.79
ases
-0.78
usa
-0.78
pac
-0.77
POSITIVE LOGITS
thing
1.19
exact
1.16
vein
1.13
amount
0.99
kind
0.99
fate
0.98
playbook
0.96
reason
0.91
principle
0.91
sort
0.91
Activations Density 6.375%