INDEX
Explanations
names or terms with a particular pattern of letters or syllables
frequently occurring abbreviations, acronyms, or shorthand notations
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.78
èĢħ
-0.74
eleph
-0.73
govtrack
-0.72
ħĭ
-0.72
Ranked
-0.71
DRAG
-0.70
Ͻ
-0.70
ãĥ¼ãĥĨ
-0.69
querque
-0.68
POSITIVE LOGITS
lich
0.89
ewater
0.85
itzer
0.79
kered
0.79
emaker
0.77
atever
0.77
nit
0.75
eworks
0.75
ework
0.74
bley
0.74
Activations Density 0.068%