INDEX
Explanations
numerical values
numeric values and related data
New Auto-Interp
Negative Logits
iman
-0.83
kas
-0.82
ige
-0.76
Invaders
-0.75
IG
-0.75
Kik
-0.74
Knight
-0.74
uka
-0.73
igmatic
-0.73
iger
-0.70
POSITIVE LOGITS
10
1.40
10
1.13
1027
0.93
Ten
0.90
1070
0.90
2010
0.90
110
0.89
102
0.89
10000
0.88
1016
0.87
Activations Density 0.155%