Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Ask HN: What do you think is coming next for generative AI?
5 points by rexbee on Dec 3, 2022 | hide | past | favorite | 4 comments
It seems GPT3 suggests the best token/word given the previous words.

Will it be possible to, given a large enough dataset of MP3 files, predict the next millisecond of audio based on previous milliseconds of audio and generate songs? Will we generate videos by predicting the next best frame?

Is there any technical reason we couldn't collect first person audio and video data with the cameras and microphone on a Quest Pro and generate how the next few minutes of our life could look?



> predict the next millisecond of audio based on previous milliseconds of audio

Not milliseconds, but AudioLM [1] already does it with just seconds, for speech (and piano). Results are already very convincing (to me).

[1] https://google-research.github.io/seanet/audiolm/examples/


yes but I think she's talking about something more like real-time, generating new output as you go through with the input (maybe like slicing windows from a stats. perspective)


"Signatures" on videos to prove that, yes, they are "authentic" and not AI-generated. I have no idea how it'd be enforced though.


Ah, you mean like in Devs?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: