INDEX
Explanations
mentions of "New" followed by varying contexts
New Auto-Interp
Negative Logits
InputDecoration
-0.59
GEBURTSDATUM
-0.57
eleste
-0.56
ClientSize
-0.55
متعلقه
-0.50
czu
-0.50
రి
-0.49
-0.49
arabes
-0.48
Chwiliwch
-0.48
POSITIVE LOGITS
York
0.69
YORK
0.64
York
0.57
york
0.56
YORK
0.55
ork
0.54
Mex
0.53
Vork
0.51
Y
0.51
Yor
0.50
Activations Density 0.201%