INDEX
Explanations
expressions related to exaggeration or intensity
intensifiers that convey extreme qualities or states
New Auto-Interp
Negative Logits
ividual
-0.75
tein
-0.73
itu
-0.68
tan
-0.67
hammad
-0.66
OTOS
-0.66
sein
-0.66
Annotations
-0.65
Semitism
-0.65
olan
-0.65
POSITIVE LOGITS
efully
0.81
wildly
0.78
inaccurate
0.76
ishly
0.71
uously
0.69
combust
0.68
impractical
0.67
assi
0.67
exagger
0.65
inco
0.64
Activations Density 0.008%