INDEX
Explanations
mentions of performing activities at home
references to the concept of home and personal space
New Auto-Interp
Negative Logits
resil
-0.72
Osw
-0.65
insula
-0.65
Rebell
-0.64
manac
-0.63
ggles
-0.61
icent
-0.61
itialized
-0.59
osi
-0.58
Tate
-0.58
POSITIVE LOGITS
aneously
0.99
without
0.89
instead
0.85
rather
0.85
using
0.80
whenever
0.78
WITHOUT
0.78
via
0.77
whilst
0.75
anywhere
0.75
Activations Density 0.271%