The paper examines the temporal voice onset time (VOT) duration of consonantal stops between the release burst of stops and the onset of vibration of the voice box. This is an acoustic study of Voice Onset Time of L1 Sindhi and L2 English stops. It aims to determine the acoustic average values of Voice Onset Time (VOT) in Sindhi English stop consonants i.e. L1 and L2 production of speech. According to the author, this descriptive study Voice Onset Time (VOT) in Sindhi-English stop consonants has been conducted first time on English-Sindhi VOT (ms) stops. The study recruited 10 (5 females and 5 male participants), who were Sindhi native speakers and English as a Second Language (ESL) Learners. The stimuli were designed in which three voicing pairs were selected from both Sindhi and English. The VOT (ms) of six Sindhi and English voicing stop pairs /p/, /t/, /k/ & /b/, /d/, and /ɡ/ were measured i.e. one voiceless and one voiced. The undergraduate students recorded their voice samples in both Sindhi and English. The data were separately analyzed from Sindhi and English languages. There were acoustic differences within the groups and between the groups in VOT (ms) of English and Sindhi. The statistical tests were run on the results where the significance level of p-value was fixed on < 0.05. The results illustrate that there is a significant difference between voiceless sounds i.e. /p/ & /k/ means. The findings of the data reveal that there is no statistically significant difference between VOT (ms) means of male & female for the voiceless & voiced consonantal sounds i.e., /t/, /b/, /d/ & /ɡ/. The data further discovered that English-Sindhi VOT characteristics of stops are not associated with gender; however, English-Sindhi VOT altered as a function of the place of articulation.