INDEX
Explanations
instances of impersonation or imitation
terms related to impersonation and its various forms
New Auto-Interp
Negative Logits
guiActiveUnfocused
-0.73
FFER
-0.71
20439
-0.70
RELEASE
-0.70
DERR
-0.70
Bio
-0.70
Belt
-0.69
Includes
-0.69
Thrust
-0.68
Streamer
-0.66
POSITIVE LOGITS
imperson
1.42
spoof
1.00
ating
0.89
ational
0.86
ators
0.84
azon
0.83
ality
0.83
atural
0.82
acy
0.82
ertodd
0.82
Activations Density 0.010%