INDEX
Explanations
specific mentions of reports
occurrences of the word "report"
New Auto-Interp
Negative Logits
conservancy
-0.78
east
-0.76
vow
-0.63
gate
-0.62
commerce
-0.61
cffff
-0.59
creen
-0.59
fw
-0.58
yg
-0.58
bene
-0.57
POSITIVE LOGITS
orial
0.89
velop
0.87
report
0.82
Crash
0.80
books
0.79
titled
0.76
compiled
0.76
eming
0.75
iry
0.75
emi
0.74
Activations Density 0.056%