INDEX
Explanations
numerical values indicating intensity or significance
New Auto-Interp
Negative Logits
seven
-0.71
Seven
-0.69
SEVEN
-0.68
七
-0.68
seven
-0.67
VII
-0.66
seventh
-0.65
Personensuche
-0.64
七
-0.63
Bourgoin
-0.62
POSITIVE LOGITS
0.62
4
0.62
ivelany
0.57
ConstraintMaker
0.54
4
0.53
čty
0.52
0.51
annica
0.49
čtyř
0.48
keempat
0.48
Activations Density 0.057%