INDEX
Negative Logits
ymce
-0.84
Ples
-0.79
Egl
-0.77
للمعارف
-0.75
KOM
-0.73
Giz
-0.73
ople
-0.73
Ech
-0.71
Healey
-0.70
bestos
-0.68
POSITIVE LOGITS
Guard
1.41
Guards
1.34
GUARD
1.34
guard
1.26
Guard
1.23
guards
1.20
Guards
1.18
GUARD
1.09
guard
0.97
AuthGuard
0.95
Activations Density 0.011%