INDEX
Negative Logits
quantitatively
0.54
dynamism
0.47
constraints
0.47
locomotion
0.46
avier
0.46
development
0.46
nutritive
0.45
cantilever
0.45
effector
0.45
articulation
0.45
POSITIVE LOGITS
scammers
1.71
scam
1.66
scams
1.64
fraudsters
1.47
詐
1.46
fraudulent
1.38
Scam
1.29
诈
1.27
骗
1.26
fraud
1.24
Activations Density 0.040%