INDEX
Explanations
names of people and titles
New Auto-Interp
Negative Logits
Hornets
-0.65
Brav
-0.64
spoiler
-0.62
âĶĢâĶĢ
-0.61
Slovenia
-0.61
rivals
-0.58
Bahamas
-0.58
Glac
-0.58
Clash
-0.58
ultras
-0.58
POSITIVE LOGITS
Jr
1.14
uez
0.93
icum
0.85
isner
0.84
kowski
0.83
ĸļ
0.82
opoulos
0.82
zinski
0.81
Jr
0.80
III
0.80
Activations Density 0.189%