INDEX
Explanations
recurring expressions of inevitability or frequency in language
New Auto-Interp
Negative Logits
pis
-0.17
ambre
-0.16
ÏĦαÏĤ
-0.15
ipay
-0.14
odus
-0.14
ero
-0.14
ÙĴس
-0.14
ules
-0.14
})(
-0.14
agh
-0.14
POSITIVE LOGITS
happens
0.38
happen
0.37
happened
0.35
happening
0.33
occurs
0.33
occur
0.32
aconte
0.31
Happ
0.28
occurred
0.28
ocor
0.28
Activations Density 0.006%