INDEX
Explanations
actions and processes, particularly those involving interaction or change
New Auto-Interp
Negative Logits
häl
-0.54
quæ
-0.54
schrank
-0.52
dieux
-0.52
))^
-0.50
perfección
-0.49
houſe
-0.49
område
-0.49
)))))
-0.48
gæ
-0.47
POSITIVE LOGITS
ing
1.09
ING
1.05
ting
0.95
ating
0.91
ging
0.91
ering
0.88
ding
0.87
ning
0.86
ening
0.86
aming
0.85
Activations Density 1.639%