INDEX
Explanations
names of people or characters, potentially with a focus on surnames
specific names and terms related to individuals or entities, particularly focusing on surnames and certain notable references
New Auto-Interp
Negative Logits
ilial
-0.92
boards
-0.68
ously
-0.68
çķ
-0.66
reception
-0.65
ZI
-0.64
士
-0.64
board
-0.64
apter
-0.63
ptin
-0.63
POSITIVE LOGITS
Vik
0.97
ĸļ
0.87
apore
0.87
icious
0.84
Varg
0.79
anova
0.78
Alonso
0.78
eks
0.74
ileaks
0.73
oslav
0.72
Activations Density 0.020%