INDEX
Explanations
phrases related to actions and processes
phrases that describe actions or conditions related to observation and interpretation
New Auto-Interp
Negative Logits
paio
-0.75
ajo
-0.73
iland
-0.68
awar
-0.66
Vaughan
-0.65
oice
-0.64
Lev
-0.63
hov
-0.61
ahoo
-0.60
mar
-0.60
POSITIVE LOGITS
properly
0.87
individually
0.72
correctly
0.65
cule
0.64
scientifically
0.63
separately
0.61
Higher
0.61
});
0.61
geographically
0.60
sufficiently
0.60
Activations Density 0.079%