INDEX
Negative Logits
Squirrel
-0.77
Melville
-0.76
Barnet
-0.74
mybatisplus
-0.73
septum
-0.72
Kela
-0.72
Marta
-0.72
Barrington
-0.72
Packard
-0.72
ddelweddau
-0.71
POSITIVE LOGITS
Ele
0.73
كويكب
0.61
Chal
0.60
charge
0.59
honours
0.58
Charge
0.57
BoxShadow
0.57
embar
0.57
bicara
0.57
Pras
0.56
Activations Density 1.808%