INDEX
Explanations
superlatives or extreme comparisons
phrases indicating likelihood or probability
New Auto-Interp
Negative Logits
Palace
-0.63
asar
-0.61
instead
-0.61
Recording
-0.59
utions
-0.58
kus
-0.56
clusive
-0.56
NCT
-0.56
Chips
-0.55
rompt
-0.54
POSITIVE LOGITS
likely
1.19
definitely
1.17
certainly
1.14
importantly
1.09
prominently
1.07
likely
1.03
notably
1.00
commonly
0.97
assured
0.97
Likely
0.93
Activations Density 0.044%