INDEX
Explanations
instances of repetitive phrases or expressions, particularly those involving actions and social interactions
New Auto-Interp
Negative Logits
abox
-0.07
airy
-0.07
umas
-0.07
ç¸
-0.07
iani
-0.07
عÙħÙĦÛĮ
-0.07
thur
-0.07
trap
-0.07
ObjectContext
-0.07
_atomic
-0.06
POSITIVE LOGITS
occasionally
0.07
subs
0.07
eventual
0.07
vie
0.06
occ
0.06
talk
0.06
ev
0.06
enjoy
0.06
pass
0.06
hoping
0.06
Activations Density 0.039%