INDEX
Explanations
initials or abbreviations that include 'OD' with increasing activation values
references to the abbreviation "OD" related to various contexts
New Auto-Interp
Negative Logits
Pry
-0.74
Welsh
-0.71
Athena
-0.67
Percy
-0.67
Pens
-0.65
Ki
-0.65
Tate
-0.64
Lerner
-0.64
Schne
-0.64
Weston
-0.64
POSITIVE LOGITS
IUM
1.13
OME
1.05
ependent
1.02
ODUCT
0.98
CAST
0.95
MAP
0.94
OD
0.92
irect
0.92
DEN
0.92
gins
0.90
Activations Density 0.014%