INDEX
Explanations
phrases indicating significant durations or assessments related to time or context
New Auto-Interp
Negative Logits
kasarigan
-0.47
پیوند
-0.44
rü
-0.44
ryt
-0.42
inois
-0.42
arked
-0.41
arrived
-0.41
Unies
-0.39
ypus
-0.39
voorkomen
-0.39
POSITIVE LOGITS
over
1.39
OVER
1.21
Over
1.09
över
1.06
over
1.02
Over
1.01
über
1.01
RegistryLite
0.99
mergeFrom
0.95
над
0.93
Activations Density 0.201%