INDEX
Explanations
phrases related to evaluating the wisdom or merit of an action
statements regarding the quality of ideas
New Auto-Interp
Negative Logits
andise
-0.73
wood
-0.72
lighting
-0.71
woods
-0.70
vin
-0.66
ighters
-0.66
anches
-0.65
cious
-0.65
aques
-0.63
ancers
-0.63
POSITIVE LOGITS
ually
0.93
proposition
0.68
ALLY
0.64
uristic
0.64
outwe
0.64
aimed
0.63
accelerator
0.63
considering
0.63
Ĥİ
0.62
DEM
0.61
Activations Density 0.026%