INDEX
Explanations
references to real-life examples or contexts, perhaps related to educational or science-related scenarios
New Auto-Interp
Negative Logits
aido
-0.73
edin
-0.72
xit
-0.72
ressed
-0.70
wagon
-0.70
ressive
-0.70
indal
-0.69
alos
-0.68
pid
-0.68
ervative
-0.68
POSITIVE LOGITS
scenarios
0.88
equivalents
0.82
situations
0.82
examples
0.80
counterparts
0.78
counterpart
0.77
scenario
0.76
embodiment
0.76
occurrences
0.73
experience
0.72
Activations Density 0.043%