INDEX
Explanations
the words 'like', 'only', and 'both'
New Auto-Interp
Negative Logits
religieuse
-0.59
asmen
-0.56
étrangère
-0.56
indépendante
-0.54
lettura
-0.54
manquante
-0.54
voks
-0.54
réduite
-0.53
ifrån
-0.52
barrera
-0.52
POSITIVE LOGITS
GraphicsUnit
0.80
AxisAlignment
0.76
InjectAttribute
0.75
AutoresizingMask
0.67
UserScript
0.66
NameInMap
0.63
>=",
0.63
>",
0.63
spacy
0.62
TagMode
0.61
Activations Density 0.283%