INDEX
Negative Logits
AnchorStyles
-0.67
ever
-0.58
trị
-0.58
weist
-0.58
orum
-0.57
ω
-0.56
mik
-0.54
Erm
-0.54
Conan
-0.54
?[
-0.53
POSITIVE LOGITS
tobacco
1.08
Tobacco
1.05
Tobacco
1.03
cigarettes
0.85
purpoſe
0.82
neceff
0.81
Monfieur
0.80
nicotine
0.78
ſtate
0.77
SEGUIR
0.77
Activations Density 0.004%