INDEX
Explanations
verbs that denote actions or processes
gerunds and other verbs that indicate actions or ongoing processes
New Auto-Interp
Negative Logits
eria
-0.74
ana
-0.73
UI
-0.72
youtube
-0.72
eg
-0.70
IE
-0.70
UG
-0.67
ena
-0.67
ocl
-0.67
.–
-0.67
POSITIVE LOGITS
them
0.76
instead
0.72
accordingly
0.70
untold
0.69
homage
0.69
unsus
0.68
nods
0.66
comparisons
0.66
only
0.66
whoever
0.64
Activations Density 0.332%