INDEX
Explanations
intensifiers that amplify descriptions or attributes
New Auto-Interp
Negative Logits
ded
-0.67
Griffin
-0.66
id
-0.66
проще
-0.66
برانيه
-0.65
Jackman
-0.64
zkod
-0.64
filepath
-0.63
Akhtar
-0.63
reclama
-0.62
POSITIVE LOGITS
very
1.30
Very
1.26
VERY
1.25
VERY
1.25
Very
1.24
very
1.23
sehr
1.01
Molto
0.94
muy
0.92
très
0.92
Activations Density 0.060%