INDEX
Negative Logits
_INTERFACE
-0.15
ite
-0.15
enta
-0.15
Maurice
-0.14
nung
-0.14
aurus
-0.14
249
-0.14
undi
-0.14
eo
-0.13
ihan
-0.13
POSITIVE LOGITS
cake
0.28
fulness
0.27
fully
0.25
FUL
0.21
-tree
0.20
arian
0.20
juice
0.19
å¶
0.19
FULL
0.19
full
0.19
Activations Density 0.012%