INDEX
Negative Logits
DC
-0.86
dc
-0.64
DC
-0.61
Delhi
-0.59
cur
-0.56
unknownFields
-0.54
aarrggbb
-0.52
DB
-0.50
db
-0.50
autorytatywna
-0.50
POSITIVE LOGITS
ſelves
0.80
Monfieur
0.73
pleaſure
0.72
varandra
0.71
itſelf
0.71
myſelf
0.71
bibfield
0.68
čierna
0.68
purpoſe
0.67
iſt
0.66
Activations Density 0.170%