INDEX
Explanations
prepositions used in combination with action verbs
phrases indicating conditional or dependent relationships
New Auto-Interp
Negative Logits
ryu
-0.95
thood
-0.77
erson
-0.74
Liberties
-0.69
odox
-0.68
listed
-0.66
ãģ«
-0.65
encer
-0.65
eson
-0.65
istani
-0.64
POSITIVE LOGITS
seams
1.30
edges
0.85
cracks
0.79
fumes
0.78
uncontroll
0.75
sudden
0.73
exhaustion
0.72
ankles
0.72
acidic
0.71
incompetence
0.71
Activations Density 0.442%