If you want to know how to use Mellonn Speak from start to finish, with uploading an audio file editing the results and then exporting a DOCX file, then you've come to the right place!
Use a file from voice recording app
If you’ve used your phone’s recording app, you can select the recording inside the recording app used, and press the share button. In here you should see Mellonn Speak as an option, how the sharing process works, can vary from phone to phone.
The rest of the process is the same as the following, except the audio file have already been chosen.
Start a transcription job
The most important part of the app, is meant to be as intuitive as possible. But if you still have trouble with this and you can't find your answers here, please don't hesitate to report an issue. When you have access to the audio file you want to use, through the Archives (or Files) app, then you can start a transcription job.
- On the Record page, press the green button saying "Upload recording".
- Start by pressing the green button saying "Select Audio File", where a file selector will pop up. Now choose the audio file you want to use.
- Choose a title, description, how many participants and the language spoken.
- When you are ready press the green button saying "Next", to proceed to checkout.
- Here you will be presented with information on your order, and if every thing looks right you can press the green button saying "Pay" (on some versions it will say "Pay" even though it's free, this has been fixed).
- Your recording should now be uploaded, and you can see the job status on the Recordings page (when it's done transcribing there will be a checkmark next to the title).
- Note: you may need to reload the page, to update the process.
Read the transcription
When the transcribing job is done, there will be a checkmark next to the title, on the recording page. When the job is done, you can press the recording to open it, it will take some time downloading the result, depending on the length of the recording.
- If this is the first time opening the transcription, you will be prompted to label the participants. This can be done by listening to short clips from the recording, to make sure your labels are correct. This can also be changed later.
- You will now be presented a chat inspired page, where you can read the transcription.
- Underneath each chat bubble, there will be a description of it, with information about the speaker and what time interval of the recording it is.
- You can either press a chat bubble or pull it out, to get the option to play the recording in that time frame or you can edit the text in that bubble.
- In the orange box with the title, there's a three dot menu, this menu gives you the following options:
- “Edit labels”: edit the labels associated with the participants.
- "Edit": listen to the whole recording, while you can change who's speaking at the given time.
- "Download DOCX": save the recording as a DOCX file (Word), on your device.
- "Info": shows info about your recording, including title, description, file name and number of speakers (participants).
- "Delete this recording": this will delete the recording, this CANNOT be undone.
Edit the speaker labels
When you press the "Edit" button in the three dot menu, you will be presented with a page for choosing who's speaking when. While listening to the recording, you can change who's speaking by either overwriting what the AI thinks, or by doing it by yourself.