INDEX
Explanations
phrases related to physical attributes and actions
negative descriptions or characteristics of objects or situations
New Auto-Interp
Negative Logits
Democracy
-0.72
national
-0.70
Ethics
-0.70
iberal
-0.69
Liberal
-0.68
democracy
-0.67
Government
-0.65
Liberal
-0.65
Nation
-0.65
government
-0.64
POSITIVE LOGITS
moisture
0.80
texture
0.80
waterproof
0.76
textures
0.75
noticeable
0.75
rotate
0.74
diagonal
0.74
airflow
0.74
tactile
0.73
removable
0.73
Activations Density 2.265%