INDEX
Explanations
keywords related to openness and open environments
New Auto-Interp
Negative Logits
Opening
-0.25
Opening
-0.24
opening
-0.23
opening
-0.20
opener
-0.20
-opening
-0.19
opened
-0.18
Opens
-0.18
opens
-0.17
opens
-0.17
POSITIVE LOGITS
-ended
0.38
-air
0.33
ended
0.32
ended
0.30
Ended
0.27
Ended
0.27
-source
0.25
baar
0.25
air
0.24
-plan
0.24
Activations Density 0.031%