This project is a speech recognition based text editor with Multiple language including Indian language and also various functionality like Paraphrasing, Audio or video recordings to text, translator

Overview

Speechnotes

Speechnotes is Speech Recognition based text Editor where we type the sentence througn our voice with multiple languages including our Indian Languages like Hindi,Marathi,tamil etc. With that it also transcribes the audio and video recordings to text and also it is **** multilingual***. This Application also comes with Multiple features like Speech Recognition based Translator, Paraphrasing, Spell Checker.

Table of Contents

• Installation • Dependencies • Use cases • Sneak peeks • Authors

Installation

You would need mysql and python. Clone this project and run it as any python project and than from the root of the repoistry run pip install -r requirements.txt

Dependencies

⮚ Speech to text & Text to Speech with various languages including Indian Languages.

⮚ Audio or Video Recordings to text and can be save in pdf format.

⮚ Text editor with various editing tools & Spell Checker.

⮚ Paraphrasing to rephrase the sentences without changing its meaning.

⮚ Plagiarism to evaluate various research articles in general.

⮚ Translator with speech to text for regional languages & with multiple inputs to translate English to Hindi or vice versa.

⮚ Effective accuracy

Use Cases

⮚ Useful for students and teachers for preparing notes from online recordings.

⮚ Useful for person having problem like Dyslexia.

⮚ Useful for research Scholars, Professor, scientists & for various Working professionals.

⮚ Useful for Security

sneak Peek

Screenshot 2022-03-25 165507 Screenshot 2022-04-10 103004 Screenshot 2022-03-25 165651 Screenshot 2022-04-10 101718

Authors

• Suraj Singh @SinghSuraj-04092002

You might also like...

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.

Version 2.0 (stable) Welcome to my homepage! News UNetFormer (accepted by ISPRS, Research Gate) and UAVid dataset are supported. ISPRS Vaihingen and P

Sep 26, 2022

This python3 code will add intro + outro + logo to your video and then will upload your edited video to YouTube, including all video details by a single click.

Python3 Video Editor + YouTube API Uploader This Python3 code allows you to add intro + outro + logo + upload complete video, including all video deta

Sep 7, 2022

Final Year Project: Surface Electromyography (sEMG) Silent Speech - Automatic Speech Recognition (ASR)

sEMG Silent Speech - Automatic Speech Recognition (ASR) About This module is produced as part of the requirement for my Final Year Project for my BSc

Jun 13, 2022

Useful functionality to interact with Maya, including OOP nodes

MayaX Table of Contents Introduction Installation Usage API Node Properties Node methods Attribute Properties Attribute methods Misc Introduction Maya

Sep 24, 2022

Built a GUI Based Face Recognition Module. Functionality that allows people to register their faces along with their name and enrollment number to make a paperless and smooth attendance system.

Built a GUI Based Face Recognition Module. Functionality that allows people to register their faces along with their name and enrollment number to make a paperless and smooth attendance system.

Face_recognition_based_attendance_system A python GUI integrated attendance system using face recognition to take attendance. In this python project,

Aug 17, 2022

A file encryption software which can encrypt and decrypt various file formats such as pdfs, csv, image, audio, video

This is a file encryption software which can encrypt and decrypt various file formats such as pdfs, csv, image, audio, video, etc. It sends the encrypted file through email and the key encrypted with RSA through SMS.

Apr 23, 2022

Text to Emoji ✨- A text to emoji translator built using Tensorflow, NLTK in the backend

A text to emoji translator built using Tensorflow, NLTK in the backend. The UI is brought together with React and TailWind.

Jun 30, 2022

Uses Google cloud's Text-to-Speech client libraries to convert text to downloadable audio

text-to-speech-webpage Uses Google cloud's Text-to-Speech client libraries to convert text to downloadable audio Steps to run the app locally Create a

Sep 20, 2022
Owner
Suraj Singh
NAmasTE Everyone Myself Suraj Singh a developer engineer student.
Suraj Singh
This project is done using Tesseract OCR in python. Tesseract is an optical character recognition (OCR) engine for various operating systems. Using this project one can do various text recognition operations

Text-Recognition-OCR This project is done using Tesseract OCR in python. Tesseract is an optical character recognition (OCR) engine for various operat

null 1 Jul 5, 2022
Facestar dataset. High quality audio-visual recordings of human conversational speech.

Facestar Dataset Description Existing audio-visual dataset for human speech are either captured in a clean, controlled environment but contain only a

Meta Research 81 Sep 12, 2022
SEPIA Speech-To-Text (STT) Server is a WebSocket based, full-duplex Python server for realtime automatic speech recognition (ASR) supporting multiple open-source ASR engines

SEPIA Speech-To-Text Server SEPIA Speech-To-Text (STT) Server is a WebSocket based, full-duplex Python server for realtime automatic speech recognitio

SEPIA 76 Sep 21, 2022
Auto-Editor is a command line application for automatically editing video and audio by analyzing a variety of methods, most notably audio loudness.

Auto-Editor is a command line application for automatically editing video and audio by analyzing a variety of methods, most notably audio loudness. Be

WyattBlue 1.2k Sep 24, 2022
[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast pyav video decoding.

CenterCLIP CenterCLIP achieves state-of-the-art text-video retrieval performance and decent computation cost reduction on MSVD, MSRVTT, LSMDC, and Act

Shuai Zhao 64 Aug 19, 2022
Timething is a library for aligning text transcripts with their audio recordings.

Timething Timething is a library for aligning text transcripts with audio. You provide an audio file, as well as a text file with the complete text tr

FELD Berlin 14 Aug 10, 2022
A mulit language translator and able to generate audio from text generated.

NLP-Multi-language-translator An app for translating from one language to another.Almost all world languages are available. Features. Translates to an

Mugume Ian 4 Mar 24, 2022
I Created this library to parse indian address as accurate as possible and this library is opensource so you can contribute to it also.

Indian Address Parser In python I Created this library to parse indian address as accurate as possible and this library is opensource so you can contr

Shrikant Dhayje 1 Sep 22, 2022
✍️ Korean Paraphrasing Tool Using Round-trip Translation

KoQuillBot: Korean Paraphrasing Tool Implemented with Korean ↔️ English round trip translation. Description ?? Huggingface Link Github Repo Korean ➡️

null 6 Aug 2, 2022
This is a terminal python project to learn the basics of os.sys calls and also work with other apps like mpv and packages like youtube dl and youtube search.

Terminal_Music_Player_python_project This is a terminal python project to learn the basics of os.sys calls and also work with other apps like mpv and

null 1 Mar 27, 2022