INDEX
Explanations
numerical values and time-related data
New Auto-Interp
Negative Logits
Roberts
-0.18
ibur
-0.15
ukan
-0.15
Bundy
-0.15
é¾
-0.15
ترÙĥ
-0.15
inis
-0.15
oad
-0.15
enos
-0.15
Kok
-0.14
POSITIVE LOGITS
48
0.52
49
0.40
47
0.39
481
0.30
480
0.30
482
0.29
Highland
0.28
487
0.28
483
0.28
484
0.28
Activations Density 0.042%