INDEX
Explanations
names, particularly of individuals
social media handles or usernames
New Auto-Interp
Negative Logits
Lis
-0.83
XIII
-0.80
Vi
-0.79
hack
-0.76
isphere
-0.76
iso
-0.74
Forth
-0.74
HF
-0.73
yrinth
-0.72
okane
-0.72
POSITIVE LOGITS
Brown
2.80
Brown
2.72
brown
2.08
brown
1.83
Browne
1.78
Browns
1.43
Gray
1.18
Redd
1.17
Gray
1.14
Yellow
1.08
Activations Density 0.105%