INDEX
Explanations
phrases related to physical locations or people's names
words related to a state of being known as "being scared" or "fear."
New Auto-Interp
Negative Logits
Mit
-0.63
OD
-0.62
âĶĢâĶĢ
-0.59
rooting
-0.57
Crew
-0.57
Bulldogs
-0.56
YS
-0.56
Pavilion
-0.56
INS
-0.56
annels
-0.56
POSITIVE LOGITS
paren
1.02
thro
0.97
bryce
0.85
throp
0.81
atche
0.77
baugh
0.77
ndra
0.76
eer
0.75
tsky
0.74
ner
0.72
Activations Density 0.009%