INDEX
Negative Logits
Rossetti
-0.75
Neto
-0.73
Yates
-0.71
Wentworth
-0.69
Whistle
-0.69
knocking
-0.68
Guthrie
-0.68
Zidane
-0.68
WriteBarrier
-0.68
ită
-0.67
POSITIVE LOGITS
Flav
1.42
Flav
1.34
flav
1.21
flavors
1.14
flavours
1.12
flavour
1.07
flavor
1.06
flavon
1.05
flavor
1.00
Flavor
1.00
Activations Density 0.005%