INDEX
Explanations
numbers with the suffix 'plus'
phrases indicating a duration of "plus" years or decades
New Auto-Interp
Negative Logits
inem
-0.77
arij
-0.69
ayn
-0.68
omach
-0.67
adr
-0.66
ringe
-0.66
rouch
-0.66
icht
-0.65
bane
-0.64
oscope
-0.63
POSITIVE LOGITS
plus
0.98
minus
0.95
--+
0.84
Plus
0.83
ishly
0.80
/-
0.79
------------
0.74
cules
0.73
Scotia
0.73
ãģĨ
0.71
Activations Density 0.006%