INDEX
Explanations
references to personal physical experiences and sensations
New Auto-Interp
Negative Logits
winters
-0.17
Valentine
-0.17
ieur
-0.16
snowy
-0.15
åĨ¬
-0.14
snow
-0.14
lần
-0.14
hangi
-0.14
Injector
-0.14
egrity
-0.14
POSITIVE LOGITS
heat
0.36
Heat
0.32
cooling
0.30
summer
0.29
Heat
0.28
cool
0.28
Summer
0.28
overhe
0.27
heat
0.26
cool
0.26
Activations Density 0.144%