INDEX
Negative Logits
Henry
-0.08
-May
-0.08
').'
-0.08
yachts
-0.08
Fees
-0.08
Electric
-0.08
Serenity
-0.08
laptops
-0.08
vam
-0.07
fees
-0.07
POSITIVE LOGITS
learned
0.14
learnt
0.12
appris
0.12
learns
0.12
learn
0.11
Learned
0.11
aprendido
0.11
receptive
0.10
halluc
0.10
सीख
0.10
Activations Density 0.021%