Skip Navigation
Martineski

A fellow ADHDer addicted to the platform

Schedule here: lemmy.fmhy.ml/post/301360

Posts
675
Comments
389
Joined
2 yr. ago

IRulely

  • iCar

  • The pig one tho...

  • Just stumbled upon this post while looking for news and I think that it's a great example of why implementing this rule was a right call: https://www.reddit.com/r/singularity/comments/14u6x5p/toyota_claims_battery_breakthrough_with_a_range/

  • Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic (paper from 27.06.2023)

    In human conversations, individuals can indicate relevant regions within a scene while addressing others. In turn, the other person can then respond by referring to specific regions if necessary. This natural referential ability in dialogue remains absent in current Multimodal Large Language Models (MLLMs). To fill this gap, this paper proposes an MLLM called Shikra, which can handle spatial coordinate inputs and outputs in natural language. Its architecture consists of a vision encoder, an alignment layer, and a LLM. It is designed to be straightforward and simple, without the need for extra vocabularies, position encoder, pre-/post-detection modules, or external plug-in models. All inputs and outputs are in natural language form. Referential dialogue is a superset of various vision-language (VL) tasks. Shikra can naturally handle location-related tasks like REC and PointQA, as well as conventional VL tasks such as Image Captioning and VQA. Experimental results showcase Shikra's prom

    Ooer @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    u̶p̸l̴o̸a̷d̵ ̵y̴o̸u̶r̶ C̷̪͝O̵̮͝M̸̱̆p̴̧͌u̸̪̕T̸̠̐ě̸͈R̷̮̐

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    Andy Jassy dismisses Microsoft and Google A.I. ‘hype cycle’ and says Amazon is starting a ‘substance cycle’ (article from 7.07.2023)

    Amazon CEO Andy Jassy called generative A.I. “one of the biggest technical transformations of our lifetimes” in an interview with CNBC on Thursday. He also called many of today’s A.I. chatbots and other generative A.I. tools part of the “hype cycle,” declaring that Amazon was focused on the “substance cycle.”

    Amazon’s bona fides in the space are well established, having been a player in artificial intelligence and machine learning long before the ChatGPTs and Bards of the world were publicly released. Former Fortune editor Brian Dumaine wrote a book in 2020 about how Amazon founder Jeff Bezos realized early on that imbuing machine learning into every facet of the company would allow it to gather data to constantly improve itself.

    Much as it did with Amazon Web Services, which practically birthed the cloud computing industry that now powers the internet’s biggest companies, including its competitors, Amazon’s A.I. strategy is focused on cementing its position as a major player

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    RVT: Robotic View Transformer for 3D Object Manipulation (paper from 26.06.2023)

    For 3D object manipulation, methods that build an explicit 3D representation perform better than those relying only on camera images. But using explicit 3D representations like voxels comes at large computing cost, adversely affecting scalability. In this work, we propose RVT, a multi-view transformer for 3D manipulation that is both scalable and accurate. Some key features of RVT are an attention mechanism to aggregate information across views and re-rendering of the camera input from virtual views around the robot workspace. In simulations, we find that a single RVT model works well across 18 RLBench tasks with 249 task variations, achieving 26% higher relative success than the existing state-of-the-art method (PerAct). It also trains 36X faster than PerAct for achieving the same performance and achieves 2.3X the inference speed of PerAct. Further, RVT can perform a variety of manipulation tasks in the real world with just a few (∼10) demonstrations per task. Visual results, code, a

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    How Topical Application of Stem Cell Serum Can Reverse COVID-Induced Hair Loss (article from 30.06.2023)

    Covid-19 is said to cause long-term side effects in up to 67% of patients, and these health consequences can include chronic fatigue, loss of taste and smell and brain fog. Increasingly common too is Covid-related hair loss. Known as telogen effluvium, this phenomenon manifests as clumps of hair falling out after brushing or washing your hair.

    It’s normal to shed hair daily – we lose about 100-150 hairs each day as hair drops from follicles to make way for new hair growth. This growth cycle occurs because 90% of the hair on our heads is in a growth phase (called anagen), while the remaining 10% is in a resting phase (called telogen). Anagen lasts for about three years before transitioning into the shorter telogen phase, following which hair is shed.

    A stressful event like childbirth, certain medications, intense psychological stress and Covid-19 can trigger our bodies to shift a greater-than-normal proportion of growing anagen hairs into a resting telogen state, according to the

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    Derivative Free Weight-space Ensembling (paper from 7.07.2023)

    Recent work suggests that interpolating between the weights of two specialized language models can transfer knowledge between tasks in a way that multi-task learning cannot. However, very few have explored interpolation between more than two models, where each has a distinct knowledge base. In this paper, we introduce Derivative Free Weight-space Ensembling (DFWE), a new few-sample task transfer approach for open-domain dialogue. Our framework creates a set of diverse expert language models trained using a predefined set of source tasks. Next, we finetune each of the expert models on the target task, approaching the target task from several distinct knowledge bases. Finally, we linearly interpolate between the model weights using a gradient-free-optimization algorithm, to efficiently find a good interpolation weighting. We demonstrate the effectiveness of the method on FETA-Friends outperforming the standard pretrain-finetune approach.

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    "Excited to introduce 'GPT-Researcher'!" (Found this in a reddit post from 10.07.2023. Not sure if this an official announcement, the tool may be older if it's not.)

    The idea is simple - Specify what you want to research, and the AI will autonomously research it for you in minutes!

    ▸ One prompt generates an unbiased, factual and in depth research report

    ▸ Generate research, outlines, resource and lessons reports

    ▸ Aggregates over 20 web sources per research

    ▸ Includes an easy to use web interface

    ▸ Open source: https://github.com/assafelovic/gpt-researcher

    ▸ Scrapes web sources with javascript support

    ▸ Keeps track and context of visited and used web sources

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    The second fusion of laser and aerospace — an inspiration for high energy lasers (article from 25.06.2023)

    Abstract:

    Since the first laser was invented, the pursuit of high-energy lasers (HELs) has always been enthusiastic. The first revolution of HELs was pushed by the fusion of laser and aerospace in the 1960s, with the chemical rocket engines giving fresh impetus to the birth of gas flow and chemical lasers, which finally turned megawatt lasers from dream into reality. Nowadays, the development of HELs has entered the age of electricity as well as the rocket engines. The properties of current electric rocket engines are highly consistent with HELs’ goals, including electrical driving, effective heat dissipation, little medium consumption and extremely light weight and size, which inspired a second fusion of laser and aerospace and motivated the exploration for potential HELs. As an exploratory attempt, a new configuration of diode pumped metastable rare gas laser was demonstrated, with the gain generator resembling an electric rocket-engine for improved power scaling ability.

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    Dr. Behnaam Aazhang, Ph.D. - Director, Rice Neuroengineering Initiative (NEI), Rice University (video from 6.07.2023)

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    Focused Transformer: Contrastive Training for Context Scaling - 256k context length (paper from 6.07.2023)

    Original title: Focused Transformer: Contrastive Training for Context Scaling

    Large language models have an exceptional capability to incorporate new information in a contextual manner. However, the full potential of such an approach is often restrained due to a limitation in the effective context length. One solution to this issue is to endow an attention layer with access to an external memory, which comprises of (key, value) pairs. Yet, as the number of documents increases, the proportion of relevant keys to irrelevant ones decreases, leading the model to focus more on the irrelevant keys. We identify a significant challenge, dubbed the distraction issue, where keys linked to different semantic values might overlap, making them hard to distinguish. To tackle this problem, we introduce the Focused Transformer (FoT), a technique that employs a training process inspired by contrastive learning. This novel approach enhances the structure of the (key, value) space, enabling an extensio

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    Inside Google’s big AI shuffle — and how it plans to stay competitive, with Google DeepMind CEO Demis Hassabis (article from 10.07.2023)

    hmmmGIFs @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    hmmm

    hmmm @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    hmmm

    hmmm @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    hmmm

    hmmm @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    hmmm

    Ooer @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    Lift bar to create a M̸̳̅Ḁ̴̒S̴̥͌S̶͓̅ ̴̹́É̶͔X̸̮̆T̷̢͝I̶̛͓N̸̢̎C̸̩͋Ṱ̶̂I̴͔̽O̸̠͂N̷̜̈́

    Ooer @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    Baby

    Test

  • There's so many test posts lately.

  • It's not a commonly seen rule. I do it for transparency reasons. I will make a post about the rules and will pin it but first I will need to rewrite some rules and I need to find a time for that when I'm really busy. :x

  • Add the date of the article to the title as per our rule 6. Copy this:

    (article from 19.07.2023)

    Thank you.

  • Permanently Deleted

  • You mean my post

  • hmmm @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    A new sublemmy from the "hmmm" category was just started! Check out hmmmTexts and subscribe to it if you like the content!

    Link to the sublemmy: [email protected]

    The other sublemmy from the hmmm category that I didn't make an announcement for: [email protected]

  • You're right, seems like it:

    Currently, Claude 2 API is available to businesses only. Additionally, to gain access, you need to send a request to the Anthropic team.

    However, if you live anywhere except the US or UK, you cannot use Claude 2.

    Source: https://thenaturehero.com/claude-2-api/ ("Claude 2 API – Everything You Need To Know")

  • We are pleased to announce Claude 2, our new model. Claude 2 has improved performance, longer responses, and can be accessed via API as well as a new public-facing beta website, claude.ai.

  • Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    GPT-4 details leaked (Leak from ~10.07.2023)

    I just copy/pasted what's in the link so formatting may be broken:

    GPT-4's details are leaked.

    It is over.

    Everything is here: twitter.com/i/web/status/1… Parameters count:

    GPT-4 is more than 10x the size of GPT-3. We believe it has a total of ~1.8 trillion parameters across 120 layers. Mixture Of Experts - Confirmed.

    OpenAI was able to keep costs reasonable by utilizing a mixture of experts (MoE) model. They utilizes 16 experts within their model, each is about ~111B parameters for MLP. 2 of these experts are routed to per forward pass. MoE Routing:

    While the literature talks a lot about advanced routing algorithms for choosing which experts to route each token to, OpenAI’s is allegedly quite simple, for the current GPT-4 model.

    There roughly ~55B shared parameters for attention. Inference:

    Each forward pass inference (generation of 1 token) only utilizes ~280B parameters and ~560 TFLOPs. This contrasts with the ~1.8 trillion parameters and ~3,700 TFLOP that

    Singularity | Artificial Intelligence (ai), Technology & Futurology @lemmy.fmhy.ml
    Martineski @lemmy.fmhy.ml

    A.I. Health scans are going to become the Norm (Article from 11.07.2023)

  • Please add the date of the source to the tile as per our rule 6. Copy this:

    (paper from 18.07.2023)

    Thank you.

  • hmmm

  • And he damn does!

  • Naw, this platform is still very buggy so I doubt that it's your fault.

  • The "name" is a permament name of the sublemmy and you can only write small characters and no spaces. "Displayed name" is the one you want to have capitalised characters and other stuff in.

  • Wow, that's crazy weird.