INDEX
Explanations
words related to political, cultural, and historical terms specific to certain regions
references to a specific entity or concept, particularly one that is repeated frequently and relevant in the context
New Auto-Interp
Negative Logits
depress
-0.64
Philippines
-0.64
lead
-0.63
bacon
-0.61
kill
-0.60
Firefox
-0.58
sensitive
-0.57
flight
-0.57
Johns
-0.57
Chester
-0.56
POSITIVE LOGITS
ti
4.53
tu
1.77
tis
1.73
ta
1.60
Ti
1.58
TI
1.45
si
1.39
tin
1.34
shi
1.32
t
1.26
Activations Density 0.011%