INDEX
Explanations
acronyms related to various organizations or political discussions
references to formal organizations and their associated acronym names
New Auto-Interp
Negative Logits
answ
-0.85
icter
-0.72
igmatic
-0.72
guiActiveUn
-0.68
iences
-0.67
tremend
-0.67
zers
-0.67
flowering
-0.67
shr
-0.65
rew
-0.65
POSITIVE LOGITS
Committees
0.73
ãĥİ
0.71
اÙĦ
0.68
FF
0.62
Cooperation
0.61
eous
0.61
EY
0.61
â̦)
0.60
)].
0.60
Mines
0.60
Activations Density 0.155%