INDEX
Explanations
references to whistleblowers and the act of blowing the whistle
references to whistleblowers and the act of blowing the whistle on a situation or wrongdoing
New Auto-Interp
Negative Logits
itating
-0.73
itates
-0.73
itals
-0.71
itated
-0.69
ists
-0.67
ter
-0.66
Urban
-0.66
ivan
-0.66
ggles
-0.65
Ͻ
-0.65
POSITIVE LOGITS
whistle
1.16
dropping
0.94
whist
0.84
tower
0.80
vernment
0.79
keeper
0.78
sonian
0.75
sounded
0.75
whistlebl
0.74
backs
0.74
Activations Density 0.042%