INDEX
Explanations
words related to the concept of "rogue"
New Auto-Interp
Negative Logits
birth
-0.65
goodbye
-0.62
Fas
-0.60
AAP
-0.60
aved
-0.59
Machina
-0.55
spaced
-0.55
shorth
-0.54
Fey
-0.54
Wasserman
-0.54
POSITIVE LOGITS
raphic
1.42
raphics
1.28
raph
1.21
allery
1.10
roup
1.09
aming
1.02
ressive
0.99
rams
0.99
roups
0.98
rowth
0.97
Activations Density 0.018%