INDEX
Explanations
mentions of interactions with a specific group of people, such as colleagues or associates
references to groups of people, particularly those labeled as "fellow."
New Auto-Interp
Negative Logits
uilt
-0.69
livest
-0.62
_-
-0.61
Sue
-0.60
creen
-0.60
ussy
-0.60
âĵĺ
-0.59
anwhile
-0.59
onics
-0.59
itton
-0.59
POSITIVE LOGITS
traveler
1.22
travelers
1.17
travellers
1.06
strugg
1.05
worldly
1.01
traveller
0.95
classmates
0.92
workers
0.79
worker
0.79
inmate
0.78
Activations Density 0.071%