INDEX
Explanations
concepts related to meaning, life narratives, and the impact of actions on individuals and communities
New Auto-Interp
Head Attr Weights
0:0.05
1:0.05
2:0.02
3:0.04
4:0.03
5:0.42
6:0.02
7:0.01
8:0.05
9:0.11
10:0.11
11:0.04
Negative Logits
Britann
-1.60
kas
-1.49
uador
-1.47
Scotch
-1.45
enta
-1.44
Roses
-1.42
qua
-1.42
Bans
-1.42
Rye
-1.40
ASA
-1.39
POSITIVE LOGITS
seekers
1.89
reperto
1.74
appre
1.69
ivities
1.59
retri
1.57
mess
1.56
exploration
1.55
needy
1.53
flyers
1.51
retrieving
1.49
Activations Density 1.240%