• Some modern heros of DeepSeek (Re: the asteroid that kills tech dinosau

    From Mild Shock@21:1/5 to Mild Shock on Fri Jan 31 23:58:26 2025
    Hi,

    Please meet Luo Fuli:

    The 29-Year-Old Genius Behind DeepSeek’s AI Revolution https://www.youtube.com/watch?v=B2fxh4aoQ8Q

    I find this paper interesting, finally
    some say about fine tuning during pretraing:

    Raise a Child in Large Language Model
    13 Sep 2021 - Fuli Luo et al.
    https://arxiv.org/pdf/2109.05687

    Bye

    Mild Shock schrieb:
    Hi,

    So how its going? DeepSeek embraced by many cloud
    providers, even by NVIDIA NIM itself.

    DeepSeek-R1 Now Live With NVIDIA NIM https://blogs.nvidia.com/blog/deepseek-r1-nim-microservice/

    What what are these models doing and how are they
    trained. Is Geoffrey Hinton our only AI God? There
    seems to be another slightly disputed AI God,

    S. Hochreiter, J. Schmidhuber. Long Short-Term Memory. Neural
    Computation, 9(8):1735-1780, 1997. https://people.idsia.ch/~juergen/deep-learning-history.html

    Bye

    P.S.: It allows a mechanistic view on our linguistic
    brain if the latent space is some semantic vectors?
    So that learning is a kind of control mechanism:

    Machine Learning Approach to Model Order Reduction
    of Nonlinear Systems via Autoencoder and LSTM Networks
    Thomas Simpson - 23 Sep 2021
    https://arxiv.org/abs/2109.11213

    Mild Shock schrieb:
    Hi,

    Wait till USA figures out there is a second
    competitor besides DeepSeek, its called Yi-Lightning:

    Yi-Lightning Technical Report
    https://arxiv.org/abs/2412.01253

    It was already discussed 2 months ago:

    Eric Schmidt DROPS BOMBSHELL: China DOMINATES AI!
    https://www.youtube.com/watch?v=ddWuEUjo4u4

    Bye


    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)