INDEX
Explanations
references to time periods or historical events
New Auto-Interp
Negative Logits
omen
-0.18
ukes
-0.16
arth
-0.15
breadth
-0.15
peat
-0.15
lex
-0.15
dó
-0.14
repeated
-0.14
al
-0.14
atis
-0.14
POSITIVE LOGITS
Sesso
0.16
edl
0.15
isphere
0.15
Multiplicity
0.15
rame
0.15
SOLE
0.14
HLT
0.14
lavÃŃ
0.14
hlas
0.14
regor
0.14
Activations Density 0.021%