INDEX
Explanations
questions related to environmental and ecological impacts
New Auto-Interp
Negative Logits
BX
-0.23
BK
-0.22
BR
-0.21
BV
-0.20
BL
-0.20
BH
-0.19
BU
-0.19
BW
-0.19
/B
-0.18
BI
-0.18
POSITIVE LOGITS
brief
0.37
bother
0.34
beginner
0.34
beginners
0.33
budget
0.33
briefed
0.32
bigotry
0.32
broadcast
0.31
briefing
0.31
boolean
0.31
Activations Density 0.310%