INDEX
Explanations
projections or possibilities for future outcomes
words related to potential future events or outcomes
New Auto-Interp
Negative Logits
washer
-0.70
orno
-0.69
Maker
-0.68
core
-0.64
furt
-0.64
cloth
-0.64
raining
-0.62
boy
-0.61
val
-0.59
fashionable
-0.59
POSITIVE LOGITS
feas
1.15
berra
1.08
conce
1.05
adian
1.05
be
0.95
potentially
0.92
hypot
0.91
foresee
0.91
tremend
0.90
theoretically
0.89
Activations Density 0.085%