INDEX
Explanations
terms related to geek culture and community building
New Auto-Interp
Negative Logits
redistributed
-0.69
undai
-0.66
irreversible
-0.65
inished
-0.65
utions
-0.64
Liberties
-0.64
Adin
-0.64
miscar
-0.63
Luxem
-0.60
fentanyl
-0.59
POSITIVE LOGITS
hack
1.05
Crunch
1.04
Dad
1.02
hattan
1.01
bench
1.01
haw
0.95
geek
0.93
geist
0.91
core
0.89
y
0.86
Activations Density 0.048%