INDEX
Explanations
phrases related to significant past events or historical contexts
New Auto-Interp
Negative Logits
seamnă
-0.75
OGND
-0.69
Etc
-0.66
itp
-0.65
etc
-0.63
liksom
-0.63
Etc
-0.62
inclusief
-0.61
المعيارى
-0.60
plus
-0.58
POSITIVE LOGITS
:
0.89
:
0.83
:</
0.75
$:
0.70
:
0.70
’:
0.70
`:
0.69
':
0.68
:")
0.65
":
0.65
Activations Density 0.591%