INDEX
Explanations
the word "Rogue" or variations of it
the term "rogue" and its variations in different contexts
New Auto-Interp
Negative Logits
ori
-0.80
atsu
-0.77
ieu
-0.77
drops
-0.76
rix
-0.76
emporary
-0.75
iences
-0.73
odium
-0.71
ulty
-0.70
urity
-0.70
POSITIVE LOGITS
^^^^
0.79
Trader
0.74
trooper
0.71
rogue
0.71
esses
0.69
Squadron
0.68
trader
0.68
hun
0.67
thumb
0.66
rend
0.63
Activations Density 0.022%