INDEX
Explanations
references to occurrences in different locations or environments
occurrences of the word "in" within various contexts
New Auto-Interp
Negative Logits
accordingly
-0.79
something
-0.76
likewise
-0.74
instead
-0.72
even
-0.71
almost
-0.70
10000
-0.70
even
-0.70
almost
-0.70
whenever
-0.70
POSITIVE LOGITS
sexes
1.09
genders
0.87
sender
0.78
physical
0.75
academia
0.73
BuyableInstoreAndOnline
0.73
literal
0.69
textual
0.68
verbal
0.68
hardware
0.67
Activations Density 0.169%