INDEX
Explanations
phrases related to action or implementation
instances of words related to actions being properly applied or enacted
New Auto-Interp
Negative Logits
whatever
-0.62
ajo
-0.62
Vaughan
-0.61
Vie
-0.60
Ethiopia
-0.58
Filip
-0.58
feat
-0.56
eah
-0.56
Baal
-0.56
Hop
-0.55
POSITIVE LOGITS
properly
1.11
correctly
1.00
individually
0.82
perpend
0.74
appropriately
0.73
incorrectly
0.72
improperly
0.70
aloud
0.70
urally
0.69
together
0.69
Activations Density 0.102%