INDEX
Explanations
words related to mimicry or imitation
terms related to mimicking or imitation
New Auto-Interp
Negative Logits
ãĥģ
-0.68
UGE
-0.67
RAW
-0.65
Unified
-0.64
upon
-0.64
Reviewer
-0.63
ULTS
-0.62
hotter
-0.62
Britann
-0.62
Interstitial
-0.62
POSITIVE LOGITS
opol
1.02
etic
1.01
icked
0.96
imum
0.92
ety
0.90
ete
0.90
etically
0.89
illian
0.86
icking
0.86
azon
0.86
Activations Density 0.015%