INDEX
Explanations
phrases indicating a solution, action, or method to address an issue
prepositions indicating agency or means in actions
New Auto-Interp
Negative Logits
ational
-0.77
itto
-0.68
upon
-0.64
imental
-0.64
bian
-0.64
ertain
-0.63
kson
-0.63
asy
-0.63
>[
-0.62
ginx
-0.62
POSITIVE LOGITS
virtue
1.30
laws
1.10
products
0.97
fiat
0.85
gone
0.85
proxy
0.83
leaps
0.81
akuya
0.79
catch
0.79
multiplying
0.79
Activations Density 0.164%