INDEX
Explanations
contradictory statements or contrasts in the text
New Auto-Interp
Negative Logits
]--;
-0.60
Tiberius
-0.59
Kraken
-0.56
medesimo
-0.55
Kolo
-0.54
coar
-0.54
Baylor
-0.54
bacio
-0.52
helical
-0.52
helico
-0.52
POSITIVE LOGITS
simply
0.84
aarrggbb
0.75
ValueGeneration
0.72
vielmehr
0.72
just
0.71
lenker
0.68
simply
0.64
merely
0.64
انجليز
0.61
Референце
0.59
Activations Density 0.151%