Please visit the homepage for location and information on open hours |
Thread Rating:
What projects are you working on
|
03-12-2022, 12:08 PM,
(This post was last modified: 03-12-2022, 12:10 PM by ABearden.)
|
|||
|
|||
RE: What projects are you working on
(03-10-2022, 05:51 PM)Claud Wrote: I'm working on a project that would download mp3's from an RSS feed and transcribe the audio from those mp3's into a large JSON file. Or you can take a YouTube channel url and download all the vtt files that YouTube already has and convert them to text files. I'm curious how the JSON is connected to database indexing. I would think raw data would be more efficient, but I'm not heavily involved in Postgre. In either case, VTT files are already text files. Also, the conversion to JSON entirely depends on how you will be using it. For example, if you have a VTT file that could convert to something like this with extra metadata: Code: { For a search engine, how you store the data will be heavily influenced by the algorithm you're using. My experience with writing search engines is a bit old, but my first instinct would be to separate the individual captions into their own table and precompile individual word scores that are well indexed. |
|||
« Next Oldest | Next Newest »
|
Users browsing this thread: 2 Guest(s)