INDEX
Explanations
words related to special focus or concentration on particular issues
New Auto-Interp
Negative Logits
abestanden
-0.82
الحره
-0.80
ſelves
-0.79
prefixer
-0.79
IMPORTED
-0.79
فريبيس
-0.78
ſelf
-0.75
تانيه
-0.72
pihaknya
-0.71
accordingly
-0.70
POSITIVE LOGITS
attention
1.02
EventHandler
0.89
attention
0.80
Attention
0.79
Attention
0.77
neck
0.75
ATTENTION
0.58
atención
0.57
knife
0.55
neck
0.54
Activations Density 0.043%