INDEX
Explanations
terms related to significant problems or obstacles
references to challenges and difficulties faced by society
New Auto-Interp
Negative Logits
ãĤ¦ãĤ¹
-0.77
ossip
-0.75
vomit
-0.73
curls
-0.70
ensional
-0.70
Rules
-0.69
vez
-0.68
Bonus
-0.66
tein
-0.66
morph
-0.65
POSITIVE LOGITS
facing
1.43
confronting
1.31
faced
1.24
posed
1.11
plag
1.05
humankind
0.94
looming
0.93
confronted
0.91
ahead
0.91
confronts
0.90
Activations Density 0.222%