INDEX
Explanations
phrases related to a specific person
mentions of specific individuals, particularly related to sports
New Auto-Interp
Negative Logits
Catalyst
-0.68
replay
-0.68
Marketable
-0.61
judgment
-0.61
misinformation
-0.61
lihood
-0.60
++++++++++++++++
-0.60
mble
-0.60
Journal
-0.59
Grateful
-0.59
POSITIVE LOGITS
Poc
1.10
oco
0.98
Hots
0.98
hett
0.97
otte
0.94
atell
0.90
het
0.87
rolet
0.83
jas
0.82
seys
0.82
Activations Density 0.006%