INDEX
Explanations
proper nouns, specifically names and locations
specific names and references related to individuals or organizations
New Auto-Interp
Negative Logits
saline
-0.74
NXT
-0.72
Juven
-0.67
hips
-0.66
Escape
-0.64
Newfoundland
-0.63
eSports
-0.63
Vikings
-0.63
Wolves
-0.63
cheek
-0.62
POSITIVE LOGITS
unta
1.01
Aj
0.99
adesh
0.99
ihara
0.97
ibl
0.96
afia
0.96
ulia
0.95
urai
0.93
ihad
0.93
iral
0.91
Activations Density 0.018%