INDEX
Explanations
verbs followed by a subject performing an action
the word "as" in various contexts
New Auto-Interp
Negative Logits
iosyncr
-0.71
roads
-0.69
ktop
-0.67
Forge
-0.66
iterranean
-0.65
MY
-0.64
Ess
-0.64
iqueness
-0.63
appropri
-0.62
Intern
-0.62
POSITIVE LOGITS
they
1.02
phy
0.98
she
0.94
he
0.94
leep
0.89
usual
0.88
bestos
0.86
dusk
0.85
if
0.85
soever
0.84
Activations Density 0.113%