INDEX
Explanations
references to geek culture and related topics, including science fiction, comic books, and technology
New Auto-Interp
Negative Logits
undai
-0.82
reconc
-0.74
separating
-0.73
Chao
-0.70
OB
-0.70
estinal
-0.67
chlorine
-0.66
inates
-0.65
eele
-0.65
ouf
-0.65
POSITIVE LOGITS
core
1.02
Dad
1.02
arthed
0.99
roots
0.89
community
0.87
adelphia
0.86
fandom
0.83
istani
0.82
Tang
0.82
Leaks
0.81
Activations Density 2.597%