INDEX
Explanations
words related to predictions or forecasts
phrases indicating predictions or forecasts
New Auto-Interp
Negative Logits
unal
-0.74
atha
-0.74
tha
-0.73
Bio
-0.72
xia
-0.72
ste
-0.70
zanne
-0.70
tan
-0.69
adra
-0.68
kat
-0.67
POSITIVE LOGITS
predicts
1.15
predicted
1.14
predict
1.03
predictions
1.02
forecasts
0.95
predicting
0.95
prediction
0.93
doom
0.88
Prediction
0.87
forecast
0.84
Activations Density 0.007%