INDEX
Negative Logits
determinar
0.43
considerare
0.42
considere
0.39
доступны
0.38
reag
0.37
considerando
0.36
semnific
0.36
guida
0.36
invigorating
0.36
accessibles
0.35
POSITIVE LOGITS
fake
1.42
fake
1.23
Fake
1.23
Fake
1.22
phony
1.16
偽
1.15
pretended
1.09
deception
1.05
거짓
1.05
جعلی
1.05
Activations Density 0.353%