INDEX
Explanations
characters or symbols, possibly indicating special formatting or annotation within the text
New Auto-Interp
Negative Logits
Sloven
-0.16
Liverpool
-0.16
Jordan
-0.16
Croatian
-0.16
Slovenia
-0.15
Bolton
-0.15
Napoli
-0.15
uji
-0.15
Milan
-0.15
Croatia
-0.15
POSITIVE LOGITS
Isaac
0.90
ISA
0.58
Isa
0.56
isa
0.55
ISA
0.52
Newton
0.48
Iss
0.46
isa
0.44
Newton
0.42
Isaiah
0.37
Activations Density 0.007%