INDEX
Explanations
quoted speech or reported dialogue
New Auto-Interp
Negative Logits
ніципалі
-0.62
Mata
-0.56
Élet
-0.56
idro
-0.52
Ca
-0.52
omeness
-0.51
Metropolitan
-0.51
opolis
-0.50
PreferredItem
-0.50
zahn
-0.50
POSITIVE LOGITS
featureID
0.97
setVerticalGroup
0.83
CreateTagHelper
0.77
WillAppear
0.66
lisäksi
0.62
кож
0.62
KommentareTeilen
0.58
TNT
0.57
وأضاف
0.56
saffron
0.56
Activations Density 0.202%