INDEX
Explanations
references to mimicry or imitation
New Auto-Interp
Negative Logits
jspx
-0.86
disambiguazione
-0.65
ĝis
-0.62
secours
-0.56
portero
-0.55
зик
-0.54
totalSupply
-0.54
volna
-0.54
modb
-0.54
endsection
-0.53
POSITIVE LOGITS
Mim
1.16
mimic
1.15
Mim
1.06
mimics
1.05
imitation
1.03
mimicking
1.01
imitate
1.00
emulate
0.99
imit
0.98
mim
0.98
Activations Density 0.031%