INDEX
Explanations
phrases that denote conditions or requirements involving actions or roles
New Auto-Interp
Negative Logits
arga
-0.17
093
-0.15
lei
-0.15
ä½³
-0.15
157
-0.15
oni
-0.14
rott
-0.14
ora
-0.14
ule
-0.14
enda
-0.14
POSITIVE LOGITS
activex
0.17
zel
0.15
ÑĢÑĸÑĩ
0.15
alace
0.15
owski
0.15
GBT
0.15
arro
0.15
ä»ĺ
0.15
ingleton
0.14
baar
0.14
Activations Density 0.390%