INDEX
Explanations
keywords related to making decisions or policies
instances of the word "adopt" in various forms and contexts
New Auto-Interp
Negative Logits
ibur
-0.66
othal
-0.65
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.65
isal
-0.64
OIL
-0.63
istg
-0.62
hole
-0.61
oola
-0.59
imir
-0.58
Templ
-0.58
POSITIVE LOGITS
adopt
0.85
adopted
0.84
adopting
0.76
ively
0.74
Viper
0.72
adoption
0.72
irtual
0.68
mits
0.67
ivating
0.66
phrase
0.64
Activations Density 0.021%