INDEX
Explanations
phrases related to self-interest and social issues
complex concepts related to self-interest and interdependence in societal contexts
New Auto-Interp
Negative Logits
tours
-0.84
braces
-0.82
packs
-0.80
ambassadors
-0.80
racks
-0.79
cleaners
-0.79
tourists
-0.79
rentals
-0.78
trailers
-0.77
interns
-0.75
POSITIVE LOGITS
existing
1.38
intuitive
1.25
rational
1.23
dimensional
1.22
linear
1.19
context
1.18
defined
1.16
ministic
1.14
functional
1.13
hist
1.12
Activations Density 0.255%