INDEX
Explanations
repetition or continual actions within the text
New Auto-Interp
Negative Logits
ariato
-0.64
Spez
-0.61
يتيمه
-0.61
OrCreate
-0.61
closest
-0.59
Caprio
-0.59
heavy
-0.59
upiter
-0.59
͗
-0.58
beaten
-0.58
POSITIVE LOGITS
constantly
0.78
genres
0.70
相
0.61
constamment
0.60
ora
0.59
個
0.58
ро
0.57
uxxxx
0.56
()))
0.55
GOTREF
0.55
Activations Density 0.115%