INDEX
Explanations
words related to a specific topic or entity, potentially associated with sports
references to the UK
New Auto-Interp
Negative Logits
otle
-0.68
OGR
-0.64
poppy
-0.58
Sina
-0.57
Takeru
-0.57
sheet
-0.55
Bland
-0.55
naïve
-0.54
Democr
-0.54
OME
-0.54
POSITIVE LOGITS
ulkan
1.50
hov
1.36
umar
1.08
htar
1.05
ileaks
1.03
wu
1.02
nown
1.00
raine
0.99
won
0.96
unda
0.95
Activations Density 0.031%