INDEX
Explanations
patterns related to predicting future outcomes or probabilities
New Auto-Interp
Negative Logits
atin
-0.80
atri
-0.76
undy
-0.73
unal
-0.72
tha
-0.72
ansk
-0.71
ento
-0.68
untu
-0.66
gur
-0.65
gencies
-0.65
POSITIVE LOGITS
doom
0.89
Prediction
0.83
predictions
0.82
predicts
0.82
predict
0.80
forecasts
0.80
prediction
0.80
pred
0.77
accur
0.74
Predict
0.73
Activations Density 10.886%