INDEX
Explanations
references to legal terms and actions, especially related to breaking the law
numerical values or figures related to data and statistics
New Auto-Interp
Negative Logits
hog
-0.78
natureconservancy
-0.68
*/(
-0.68
aye
-0.68
conservancy
-0.65
ayette
-0.65
unpre
-0.65
cius
-0.64
pen
-0.64
secretaries
-0.63
POSITIVE LOGITS
]
1.10
][
0.95
]).
0.91
].
0.89
]"
0.88
]:
0.85
]),
0.84
]
0.83
])
0.83
].
0.81
Activations Density 0.032%