INDEX
Explanations
words related to an abbreviation "Op" followed by a single character or a number, with activations of 9 or 10
references to a specific type of medication or pharmaceutical agent
New Auto-Interp
Negative Logits
��
-0.70
ãĤ¨ãĥ«
-0.69
BILITY
-0.68
ा
-0.66
waive
-0.64
asher
-0.64
ictional
-0.63
usters
-0.63
guarant
-0.62
rewarded
-0.61
POSITIVE LOGITS
Op
3.94
Op
2.73
op
2.14
OP
1.50
op
1.31
Open
1.22
Ops
1.20
Oper
1.15
OP
1.11
opium
1.10
Activations Density 0.009%