INDEX
Explanations
phrases related to groups or teams
phrases that refer to groups of people or teams
New Auto-Interp
Negative Logits
lly
-0.73
cape
-0.73
uations
-0.71
initions
-0.70
let
-0.69
phase
-0.69
uer
-0.68
runtime
-0.67
catentry
-0.67
tesy
-0.67
POSITIVE LOGITS
explorers
0.98
adventurers
0.95
volunteers
0.94
advisors
0.92
advisers
0.91
strangers
0.90
scholars
0.89
thinkers
0.89
sorts
0.88
robbers
0.86
Activations Density 0.128%