INDEX
Explanations
expressions of novelty or newness
New Auto-Interp
Negative Logits
<<<<<<<<<<<<<<
-0.85
Autoritní
-0.72
resourceCulture
-0.72
مصادر
-0.68
rungsseite
-0.64
InputDecoration
-0.63
enumi
-0.63
цездатний
-0.62
erenc
-0.60
SequentialGroup
-0.59
POSITIVE LOGITS
unfamiliar
0.73
ksom
0.64
stranger
0.61
stranger
0.60
ismer
0.60
inconn
0.59
desconhe
0.58
isNew
0.58
novedad
0.57
novel
0.55
Activations Density 0.155%