RError.com

RError.com Logo RError.com Logo

RError.com Navigation

  • 主页

Mobile menu

Close
  • 主页
  • 系统&网络
    • 热门问题
    • 最新问题
    • 标签
  • Ubuntu
    • 热门问题
    • 最新问题
    • 标签
  • 帮助
主页 / 问题 / 1160837
Accepted
Yuriy Tigiev
Yuriy Tigiev
Asked:2020-08-01 13:57:47 +0000 UTC2020-08-01 13:57:47 +0000 UTC 2020-08-01 13:57:47 +0000 UTC

大表的左连接查询优化 (mssql)

  • 772

如何优化以下查询?表中的记录数 > 1400 万。

此处提供了 7 月 1 日至 3 日期间“XVG/BTC”、“XRP/BTC”、“XMR/BTC”的初始数据示例

CREATE VIEW [dbo].[v_agg_day_trades] WITH SCHEMABINDING
AS
SELECT 
        LEFT(a.pair, CHARINDEX('/', a.pair)-1) Base,
        SUBSTRING(a.pair, CHARINDEX('/', a.pair)+1, 20) Quote,      
        a.pair as Pair,
        CAST(a.dt AS DATE) [date], 
        ROUND(AVG(100 * (CASE WHEN a.price > b.price THEN a.price / b.price - 1 WHEN a.price < b.price THEN b.price / a.price - 1 ELSE null END )), 8) PriceStep, 
        ROUND(MIN(a.price), 8) MinPrice,
        ROUND(MAX(a.price), 8) MaxPrice,
        ROUND(AVG(a.price), 8) AvgPrice,
        SUM(a.amount) BaseDayVolume,
        AVG(a.amount) BaseAvgVolume,
        MIN(a.amount) BaseMinVolume,
        MAX(a.amount) BaseMaxVolume,
        SUM(a.cost) QuoteDayVolume,
        AVG(a.cost) QuoteAvgVolume,
        MIN(a.cost) QuoteMinVolume,
        MAX(a.cost) QuoteMaxVolume,
        COUNT(*) Transactions
FROM dbo.trades a WITH(NOLOCK, NOWAIT) LEFT HASH JOIN dbo.trades b WITH(NOLOCK, NOWAIT) ON a.pair = b.pair AND a.tid + 1 = b.tid  AND a.side != b.side
GROUP BY 
        LEFT(a.pair, CHARINDEX('/', a.pair)-1),
        SUBSTRING(a.pair, CHARINDEX('/', a.pair)+1, 20),        
        a.pair, 
        cast(a.dt as date)

GO

CREATE TABLE [dbo].[trades](
    [id] [int] IDENTITY(1,1) NOT NULL,
    [pair] [nvarchar](20) NOT NULL,
    [dt] [datetime2](7) NOT NULL,
    [ts] [bigint] NOT NULL,
    [tid] [int] NOT NULL,
    [side] [nvarchar](20) NOT NULL,
    [price] [float] NULL,
    [amount] [float] NULL,
    [cost] [float] NULL,
    [fee] [float] NULL
) ON [PRIMARY]
GO


USE [tb5]
GO

SET ANSI_PADDING ON
GO

/****** Object:  Index [ClusteredIndex-20200729-004955]    Script Date: 8/1/2020 09:54:42 ******/
CREATE CLUSTERED INDEX [ClusteredIndex-20200729-004955] ON [dbo].[trades]
(
    [pair] ASC,
    [ts] ASC,
    [tid] ASC
)WITH (PAD_INDEX = OFF, STATISTICS_NORECOMPUTE = OFF, SORT_IN_TEMPDB = OFF, DROP_EXISTING = OFF, ONLINE = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON, OPTIMIZE_FOR_SEQUENTIAL_KEY = OFF) ON [PRIMARY]
GO

在此处输入图像描述

sql-server
  • 1 1 个回答
  • 10 Views

1 个回答

  • Voted
  1. Best Answer
    Yuriy Tigiev
    2020-08-02T02:08:53Z2020-08-02T02:08:53Z

    更新了代码以反映 @Mike 的建议。

    CREATE VIEW [dbo].[v_agg_day_trades] WITH SCHEMABINDING
    AS
    WITH q  AS (
    
        SELECT LEFT(pair, CHARINDEX('/', pair)-1) Base, SUBSTRING(pair, CHARINDEX('/', pair)+1, 20) Quote, pair, CAST(dt AS DATE) [date], 
            price,  
            LEAD(price,1) OVER(PARTITION BY pair ORDER BY tid) next_price, 
            side,
            LEAD(side,1) OVER(PARTITION BY pair ORDER BY tid) next_side, amount, cost
        FROM dbo.trades WITH(NOLOCK, NOWAIT)
    ) 
    SELECT  Base, Quote, pair, [date],
            ROUND(MIN(q.price), 8) MinPrice,
            ROUND(MAX(q.price), 8) MaxPrice,
            ROUND(AVG(q.price), 8) AvgPrice,
            ROUND(AVG(CASE WHEN NOT (side = next_side OR price = next_price) THEN ROUND(100*ABS(price/next_price - 1),4) ELSE NULL END), 4) priceStep, 
            SUM(q.amount) BaseDayVolume,
            AVG(q.amount) BaseAvgVolume,
            MIN(q.amount) BaseMinVolume,
            MAX(q.amount) BaseMaxVolume,
            SUM(q.cost) QuoteDayVolume,
            AVG(q.cost) QuoteAvgVolume,
            MIN(q.cost) QuoteMinVolume,
            MAX(q.cost) QuoteMaxVolume
    FROM q
    GROUP BY Base, Quote, pair, [date] 
    

    /****** Object:  Index [NonClusteredIndex-20200801-213918]    Script Date: 8/1/2020 21:56:38 ******/
    CREATE NONCLUSTERED INDEX [NonClusteredIndex-20200801-213918] ON [dbo].[trades]
    (
        [pair] ASC,
        [tid] ASC
    )
    INCLUDE([side],[price],[dt],[ts],[amount],[cost],[fee]) WITH (PAD_INDEX = OFF, STATISTICS_NORECOMPUTE = OFF, SORT_IN_TEMPDB = OFF, DROP_EXISTING = OFF, ONLINE = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON, OPTIMIZE_FOR_SEQUENTIAL_KEY = OFF) ON [PRIMARY]
    GO
    

    /****** Object:  Index [ClusteredIndex-20200729-004955]    Script Date: 8/1/2020 21:57:28 ******/
    CREATE CLUSTERED INDEX [ClusteredIndex-20200729-004955] ON [dbo].[trades]
    (
        [pair] ASC,
        [ts] ASC,
        [tid] ASC
    )WITH (PAD_INDEX = OFF, STATISTICS_NORECOMPUTE = OFF, SORT_IN_TEMPDB = OFF, DROP_EXISTING = OFF, ONLINE = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON, OPTIMIZE_FOR_SEQUENTIAL_KEY = OFF) ON [PRIMARY]
    GO
    

    情况改善了很多,但是对于大数据(145,890,265 行),查询速度很慢。

    在此处输入图像描述

    • 0

相关问题

  • 从表中删除记录

  • 表扫描与非聚集索引2

  • 在另一台计算机上安装程序后建立与 SQL Server 的连接时出错

  • Azure REST API 无需浏览器即可获取令牌

  • 在 MSSQL INSTEAD 触发器中检查 UPDATE 上的值

  • 如何找出 CLR 函数的底层内容

Sidebar

Stats

  • 问题 10021
  • Answers 30001
  • 最佳答案 8000
  • 用户 6900
  • 常问
  • 回答
  • Marko Smith

    如何从列表中打印最大元素(str 类型)的长度?

    • 2 个回答
  • Marko Smith

    如何在 PyQT5 中清除 QFrame 的内容

    • 1 个回答
  • Marko Smith

    如何将具有特定字符的字符串拆分为两个不同的列表?

    • 2 个回答
  • Marko Smith

    导航栏活动元素

    • 1 个回答
  • Marko Smith

    是否可以将文本放入数组中?[关闭]

    • 1 个回答
  • Marko Smith

    如何一次用多个分隔符拆分字符串?

    • 1 个回答
  • Marko Smith

    如何通过 ClassPath 创建 InputStream?

    • 2 个回答
  • Marko Smith

    在一个查询中连接多个表

    • 1 个回答
  • Marko Smith

    对列表列表中的所有值求和

    • 3 个回答
  • Marko Smith

    如何对齐 string.Format 中的列?

    • 1 个回答
  • Martin Hope
    Alexandr_TT 2020年新年大赛! 2020-12-20 18:20:21 +0000 UTC
  • Martin Hope
    Alexandr_TT 圣诞树动画 2020-12-23 00:38:08 +0000 UTC
  • Martin Hope
    Air 究竟是什么标识了网站访问者? 2020-11-03 15:49:20 +0000 UTC
  • Martin Hope
    Qwertiy 号码显示 9223372036854775807 2020-07-11 18:16:49 +0000 UTC
  • Martin Hope
    user216109 如何为黑客设下陷阱,或充分击退攻击? 2020-05-10 02:22:52 +0000 UTC
  • Martin Hope
    Qwertiy 并变成3个无穷大 2020-11-06 07:15:57 +0000 UTC
  • Martin Hope
    koks_rs 什么是样板代码? 2020-10-27 15:43:19 +0000 UTC
  • Martin Hope
    Sirop4ik 向 git 提交发布的正确方法是什么? 2020-10-05 00:02:00 +0000 UTC
  • Martin Hope
    faoxis 为什么在这么多示例中函数都称为 foo? 2020-08-15 04:42:49 +0000 UTC
  • Martin Hope
    Pavel Mayorov 如何从事件或回调函数中返回值?或者至少等他们完成。 2020-08-11 16:49:28 +0000 UTC

热门标签

javascript python java php c# c++ html android jquery mysql

Explore

  • 主页
  • 问题
    • 热门问题
    • 最新问题
  • 标签
  • 帮助

Footer

RError.com

关于我们

  • 关于我们
  • 联系我们

Legal Stuff

  • Privacy Policy

帮助

© 2023 RError.com All Rights Reserve   沪ICP备12040472号-5