知鸦日报2024-10-12

2024-10-11 16:30:00 ~ 2024-10-12 16:30:00

Технологии

58同城技术:热门前端框架Astro在房产业务实践

摘要

Astro具有轻量级与高性能、卓越的SEO优化、兼容性与灵活性、简洁的开发体验、出色的社区支持、成本效益高等优势。

登录后可查看文章图片

王者荣耀背后的三大匹配算法及数学模型

摘要

电子竞技如此吸引人,不仅仅是因为它的刺激和竞争性,更是因为背后精确的数学匹配机制。

grab技术:Leveraging RAG-powered LLMs for Analytical Tasks

摘要

Retrieval-Augmented Generation (RAG) is a powerful process that is designed to integrate direct function calling to answer queries more efficiently by retrieving relevant information from a broad database. In the rapidly evolving business landscape, Data Analysts (DAs) are struggling with the growing number of data queries from stakeholders. The conventional method of manually writing and running similar queries repeatedly is time-consuming and inefficient. This is where RAG-powered Large Language Models (LLMs) step in, offering a transformative solution to streamline the analytics process and empower DAs to focus on higher value tasks.

In this article, we will share how the Integrity Analytics team has built out a data solution using LLMs to help automate tedious analytical tasks like generating regular metric reports and performing fraud investigations.

登录后可查看文章图片

uber技术:Genie: Uber’s Gen AI On-Call Copilot

摘要

In today’s fast-paced tech environment, maintaining robust on-call operations is crucial for ensuring seamless service functioning. Modern platform engineering teams face the challenge of efficiently managing on-call schedules, incident response, communication during critical moments, and strong customer support on Slack® channels.

This post describes Genie, an on-call copilot we built that uses generative AI to optimize communication and question-answering with on-call engineers.

登录后可查看文章图片

slack技术:We’re All Just Looking for Connection

摘要

We’ve been working to bring components of Quip’s technology into Slack with the canvas feature, while also maintaining the stand-alone Quip product. Quip’s backend, which powers both Quip and canvas, is written in Python. This is the story of a tricky bug we encountered last July and the lessons we learned along the way about being careful with TCP state. We hope that showing you how we tackled our bug helps you avoid — or find — similar bugs in the future!

登录后可查看文章图片

pinterest技术:Ray Batch Inference at Pinterest (Part 3)

摘要

Offline batch inference involves operating over a large dataset and passing the data in batches to a ML model which will generate a result for each batch. Offline batch inference jobs generally consist of a series of steps: dataloading, preprocessing, inference, post processing, and result writing. These offline batch inference jobs can be both I/O and compute intensive.

登录后可查看文章图片

Soul技术:Android 启动优化之广告流程前置探索和实践

摘要

开始阐述背景之前,先分享一个实验数据,经过线上灰度版本的验证,前置广告流程可以缩短启动平均耗时约300ms。接下来就展开说说为什么我们需要做这件事了。

启动优化是老生常谈的话题了,Soul App也持续在进行启动相关的优化。常规和"黑科技"方案都有探索并上线。但一直有一个痛点难以跨越,在核心的启动流程中,因为业务特定要求,需要等到Application执行结束后开启广告加载的流程,这样串行执行的过程,其实非常影响启动体验。

登录后可查看文章图片

小红书技术:小红书提出大模型推理加速算法 HASS 刷新 SOTA

摘要

聚焦草稿模型训练与解码间差异,强化两者在目标和上下文上对齐。

登录后可查看文章图片

携程技术:携程国际机票基础数据中台化:构建高效的数据管理和应用平台

摘要

服务器成本降低95%,新数据源接入效率提升90%。

登录后可查看文章图片

360技术:360智算中心:万卡GPU集群落地实践

摘要

360内部对于智算中心的核心诉求是性能和稳定性,本文将深入探讨360智算中心在万卡GPU集群中的落地实践过程,包括算力基础设施搭建、集群优化、AI开发平台建设、以及训练和推理加速的实现。

登录后可查看文章图片

Методы

高质量复盘方法,7000字全解析!

摘要

在企业的经营管理中,复盘是战略落地的重要一环,运用得好,对组织、对个体都有巨大的助力。

登录后可查看文章图片


‹ 2024-10-11 日报 2024-10-13 日报 ›

qrcode

关注公众号
接收推送