MySQL ON DUPLICATE KEY UPDATE 傻瓜应用教程

Wesley13
• 阅读 621

背景:最近有一个算数据的脚本几天都没出数据,而且手动跑起来的时候服务器立马卡机。目标是优化!

[work(2)@dm02 19:02:21 ~]$ nohup python twitter_click_stat.py 2012-11-02 1 >/tmp/click.log 2>&1 &

[3] 25862
[work(3)@dm02 19:05:07 ~/GALAXY_RELEASE/release]$ kill 25862

3分钟吃力7G内存

脚本流程:
1.load mysql表全部数据,加到MAP中( 2000W行的数据
2.查HIVE表,如果twitter_id在MAP中则(赋值或+=),不在则创建类并赋值
MYSQL 表结构:
create table t_walrus_click_stat(
    twitter_id int(11) not null,
    goods_id int(11) not null default 0,
    oneday_click SMALLINT(10) unsigned not null default 0,
    threeday_click MEDIUMINT(10) unsigned not null default 0,
    sevenday_click MEDIUMINT(10) unsigned not null default 0,
    all_click int(11) unsigned not null default 0,
    oneday_incr smallint(10) comment "增量" not null default 0,
    last_clicktime timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,
    primary key(twitter_id)
)ENGINE=Innodb DEFAULT CHARACTER SET utf8

优化点:

1.不把表都load进来:计算当天有点击的推,然后取这部分推的一天点击(计算增量)/所有

2.能不能在插入的时候用update 计算一天增量和全部点击  == 靠谱!!  技术???

MYSQL =====TEST

mysql> show create table test;| test  | CREATE TABLE `test` (
  `id` int(11) NOT NULL DEFAULT '0',
  `name` varchar(50) DEFAULT NULL,
  PRIMARY KEY (`id`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1 |
1 row in set (0.08 sec)

mysql> alter table test add unique (`id`);
Query OK, 0 rows affected (0.74 sec)
Records: 0  Duplicates: 0  Warnings: 0

mysql> show create table test;
| test  | CREATE TABLE `test` (
  `id` int(11) NOT NULL DEFAULT '0',
  `name` varchar(50) DEFAULT NULL,
  PRIMARY KEY (`id`),
  UNIQUE KEY `id` (`id`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1 |
1 row in set (0.00 sec)

mysql> select * from test;
+----+-----------+
| id | name      |
+----+-----------+
|  1 | wang      |
|  2 | xia       |
|  3 | wang-load |
|  4 | xia-load  |
+----+-----------+
4 rows in set (0.00 sec)

mysql> alter table test add column `click` int(11) NOT NULL DEFAULT '0';
Query OK, 4 rows affected (0.44 sec)
Records: 4  Duplicates: 0  Warnings: 0

mysql> update test set click=id;
Query OK, 4 rows affected (0.04 sec)
Rows matched: 4  Changed: 4  Warnings: 0

mysql> select * from test;
+----+-----------+-------+
| id | name      | click |
+----+-----------+-------+
|  1 | wang      |     1 |
|  2 | xia       |     2 |
|  3 | wang-load |     3 |
|  4 | xia-load  |     4 |
+----+-----------+-------+
4 rows in set (0.00 sec)

mysql> insert into test(id,name,click) values (1,'wang',101) ON DUPLICATE KEY UPDATE click=click+values(click);Query OK, 2 rows affected (0.00 sec)

mysql> select * from test;
+----+-----------+-------+
| id | name      | click |
+----+-----------+-------+
|  1 | wang      |   102 |
|  2 | xia       |     2 |
|  3 | wang-load |     3 |
|  4 | xia-load  |     4 |
+----+-----------+-------+

mysql> alter table test add column `onn_day` int(11) NOT NULL DEFAULT '100';
Query OK, 4 rows affected (0.29 sec)
Records: 4  Duplicates: 0  Warnings: 0

mysql> select * from test;
+----+-----------+-------+---------+
| id | name      | click | onn_day |
+----+-----------+-------+---------+
|  1 | wang      |   102 |     100 |
|  2 | xia       |     2 |     100 |
|  3 | wang-load |     3 |     100 |
|  4 | xia-load  |     4 |     100 |
+----+-----------+-------+---------+

mysql> insert into test(id,name,click,onn_day) values (1,'wang',100,150) ON DUPLICATE KEY UPDATE click=click+values(click);   == update 指定的字段才会更新

Query OK, 2 rows affected (0.07 sec)

mysql> select * from test;
+----+-----------+-------+---------+
| id | name      | click | onn_day |
+----+-----------+-------+---------+
|  1 | wang      |   202 |     100 |
|  2 | xia       |     2 |     100 |
|  3 | wang-load |     3 |     100 |
|  4 | xia-load  |     4 |     100 |
+----+-----------+-------+---------+
4 rows in set (0.00 sec)

mysql> insert into test(id,name,click,onn_day) values (1,'wang',50,0) ON DUPLICATE KEY UPDATE click=click+values(click),onn_day=onn_day-values(click);   == 好像是可以了
Query OK, 2 rows affected (0.04 sec)

mysql> select * from test;+----+-----------+-------+---------+
| id | name      | click | onn_day |
+----+-----------+-------+---------+
|  1 | wang      |   252 |      50 |
|  2 | xia       |     2 |     100 |
|  3 | wang-load |     3 |     100 |
|  4 | xia-load  |     4 |     100 |
+----+-----------+-------+---------+
4 rows in set (0.00 sec)

mysql>

替换成自己用的:
insert into t_walrus_click_stat(twitter_id,goods_id,oneday_click,threeday_click,sevenday_click,all_c    lick,oneday_incr) values  (),(),(),()  ON DUPLICATE KEY UPDATE oneday_click=values(oneday_click),threeday_click=values(threeday_click ),sevenday_click =values(sevenday_click ),all_click=all_click+values(oneday_click),oneday_incr=values(oneday_click)-oneday_click;

mysql> select * from t_walrus_click_stat;
twitter_id | goods_id | oneday_click | threeday_click | sevenday_click | all_click | oneday_incr | last_clicktime      |
|  493925040 |        0 |         9286 |          21348 |          43676 |     67796 |        1003 | 2012-11-07 14:35:53 |
|  499177366 |        0 |        10134 |          23417 |          46144 |     75628 |        1382 | 2012-11-07 14:35:53 |
|  518431643 |        0 |         2099 |           6530 |          16786 |     66398 |         356 | 2012-11-07 14:35:53 |
|  521260243 |        0 |         9282 |          25611 |          48083 |     80877 |       -2683 | 2012-11-07 14:35:53 |
|  522773775 |        0 |         4652 |          20168 |          42072 |     67077 |       -3707 | 2012-11-07 14:35:53 |
|  527912329 |        0 |         4532 |          14810 |          39302 |     76832 |        -286 | 2012-11-07 14:35:53 |
|  536369413 |        0 |         3602 |          13124 |          29901 |     79792 |       -3322 | 2012-11-07 14:35:53 |
|  538737288 |        0 |         4800 |          13537 |          38469 |     83509 |        1122 | 2012-11-07 14:35:53 |
|  546735988 |        0 |        11573 |          35490 |          78441 |    115983 |         709 | 2012-11-07 14:35:53 |
|  547904288 |        0 |         8029 |          18536 |          45436 |     78306 |        4145 | 2012-11-07 14:35:53 |
+------------+----------+--------------+----------------+----------------+-----------+-------------+---------------------+
10 rows in set (0.00 sec)

mysql> insert into t_walrus_click_stat(twitter_id,goods_id,oneday_click,threeday_click,sevenday_click,all_click,oneday_incr) values  (493925040,0,1000,25000,53000,0,0),(499177366,0,1000,25000,53000,0,0),(547904289,0,100,100,100,0,0) ON DUPLICATE KEY UPDATE oneday_click=values(oneday_click),threeday_click=values(threeday_click ),sevenday_click =values(sevenday_click ),all_click=all_click+values(oneday_click),oneday_incr=oneday_click-values(oneday_click);Query OK, 5 rows affected (0.08 sec)
Records: 3  Duplicates: 2  Warnings: 0

mysql> select * from t_walrus_click_stat;twitter_id | goods_id | oneday_click | threeday_click | sevenday_click | all_click | oneday_incr | last_clicktime      |+------------+----------+--------------+----------------+----------------+-----------+-------------+---------------------+
|  493925040 |        0 |         1000 |          25000 |          53000 |     68796 |           0 | 2012-11-07 14:35:53 |
|  499177366 |        0 |         1000 |          25000 |          53000 |     76628 |           0 | 2012-11-07 14:35:53 |
|  518431643 |        0 |         2099 |           6530 |          16786 |     66398 |         356 | 2012-11-07 14:35:53 |
|  521260243 |        0 |         9282 |          25611 |          48083 |     80877 |       -2683 | 2012-11-07 14:35:53 |
|  522773775 |        0 |         4652 |          20168 |          42072 |     67077 |       -3707 | 2012-11-07 14:35:53 |
|  527912329 |        0 |         4532 |          14810 |          39302 |     76832 |        -286 | 2012-11-07 14:35:53 |
|  536369413 |        0 |         3602 |          13124 |          29901 |     79792 |       -3322 | 2012-11-07 14:35:53 |
|  538737288 |        0 |         4800 |          13537 |          38469 |     83509 |        1122 | 2012-11-07 14:35:53 |
|  546735988 |        0 |        11573 |          35490 |          78441 |    115983 |         709 | 2012-11-07 14:35:53 |
|  547904288 |        0 |         8029 |          18536 |          45436 |     78306 |        4145 | 2012-11-07 14:35:53 |
|  547904289 |        0 |          100 |            100 |            100 |         0 |           0 | 2012-11-07 14:39:10 |
+------------+----------+--------------+----------------+----------------+-----------+-------------+---------------------+
11 rows in set (0.00 sec)

2个问题:
1.其他的字段都OK,只有oneday_incr 为0。怀疑是  UPDATE oneday_click=values(oneday_click),threeday_click=values(threeday_click ),sevenday_click =values(sevenday_click ),all_click=all_click+values(oneday_click),oneday_incr=oneday_click-values(oneday_click);  先oneday_click=values(oneday_click)赋值了,所有再相减必然是0

2. 对应以前表里没有的数据所有字段都要在SQL中补全。all_click | oneday_incr  是 sevenday_click,oneday_click,

insert into t_walrus_click_stat(twitter_id,goods_id,oneday_click,threeday_click,sevenday_click,all_click,oneday_incr) values  (493925040,0,10000,25000,53000,0,0),(499177366,0,10000,25000,53000,0,0),(547904289,0,100,100,100,0,0) ON DUPLICATE KEY UPDATE oneday_incr=cast(cast(values(oneday_click) AS SIGNED)-cast(oneday_click AS SIGNED) AS SIGNED),oneday_click=values(oneday_click),threeday_click=values(threeday_click ),sevenday_click =values(sevenday_click ),all_click=all_click+values(oneday_click);
Query OK, 5 rows affected (0.00 sec)
Records: 3  Duplicates: 2  Warnings: 0

mysql>
mysql> select * from t_walrus_click_stat;+------------+----------+--------------+----------------+----------------+-----------+-------------+---------------------+| twitter_id | goods_id | oneday_click | threeday_click | sevenday_click | all_click | oneday_incr | last_clicktime      |+------------+----------+--
|  493925040 |        0 |        10000 |          25000 |          53000 |     77796 |         714 | 2012-11-07 14:41:46 |
|  499177366 |        0 |        10000 |          25000 |          53000 |     85628 |        -134 | 2012-11-07 14:41:46 |
|  518431643 |        0 |         2099 |           6530 |          16786 |     66398 |         356 | 2012-11-07 14:41:46 |
|  521260243 |        0 |         9282 |          25611 |          48083 |     80877 |       -2683 | 2012-11-07 14:41:46 |
|  522773775 |        0 |         4652 |          20168 |          42072 |     67077 |       -3707 | 2012-11-07 14:41:46 |
|  527912329 |        0 |         4532 |          14810 |          39302 |     76832 |        -286 | 2012-11-07 14:41:46 |
|  536369413 |        0 |         3602 |          13124 |          29901 |     79792 |       -3322 | 2012-11-07 14:41:46 |
|  538737288 |        0 |         4800 |          13537 |          38469 |     83509 |        1122 | 2012-11-07 14:41:46 |
|  546735988 |        0 |        11573 |          35490 |          78441 |    115983 |         709 | 2012-11-07 14:41:46 |
|  547904288 |        0 |         8029 |          18536 |          45436 |     78306 |        4145 | 2012-11-07 14:41:46 |
|  547904289 |        0 |          100 |            100 |            100 |         0 |           0 | 2012-11-07 16:50:49 |
+------------+----------+--------------+----------------+----------------+-----------+-------------+---------------------+
11 rows in set (0.03 sec)

OK。搞定!!

点赞
收藏
评论区
推荐文章
blmius blmius
2年前
MySQL:[Err] 1292 - Incorrect datetime value: ‘0000-00-00 00:00:00‘ for column ‘CREATE_TIME‘ at row 1
文章目录问题用navicat导入数据时,报错:原因这是因为当前的MySQL不支持datetime为0的情况。解决修改sql\mode:sql\mode:SQLMode定义了MySQL应支持的SQL语法、数据校验等,这样可以更容易地在不同的环境中使用MySQL。全局s
Jacquelyn38 Jacquelyn38
2年前
2020年前端实用代码段,为你的工作保驾护航
有空的时候,自己总结了几个代码段,在开发中也经常使用,谢谢。1、使用解构获取json数据let jsonData  id: 1,status: "OK",data: 'a', 'b';let  id, status, data: number   jsonData;console.log(id, status, number )
皕杰报表之UUID
​在我们用皕杰报表工具设计填报报表时,如何在新增行里自动增加id呢?能新增整数排序id吗?目前可以在新增行里自动增加id,但只能用uuid函数增加UUID编码,不能新增整数排序id。uuid函数说明:获取一个UUID,可以在填报表中用来创建数据ID语法:uuid()或uuid(sep)参数说明:sep布尔值,生成的uuid中是否包含分隔符'',缺省为
Stella981 Stella981
2年前
Python3:sqlalchemy对mysql数据库操作,非sql语句
Python3:sqlalchemy对mysql数据库操作,非sql语句python3authorlizmdatetime2018020110:00:00coding:utf8'''
Wesley13 Wesley13
2年前
mysql设置时区
mysql设置时区mysql\_query("SETtime\_zone'8:00'")ordie('时区设置失败,请联系管理员!');中国在东8区所以加8方法二:selectcount(user\_id)asdevice,CONVERT\_TZ(FROM\_UNIXTIME(reg\_time),'08:00','0
Wesley13 Wesley13
2年前
00:Java简单了解
浅谈Java之概述Java是SUN(StanfordUniversityNetwork),斯坦福大学网络公司)1995年推出的一门高级编程语言。Java是一种面向Internet的编程语言。随着Java技术在web方面的不断成熟,已经成为Web应用程序的首选开发语言。Java是简单易学,完全面向对象,安全可靠,与平台无关的编程语言。
Stella981 Stella981
2年前
Android蓝牙连接汽车OBD设备
//设备连接public class BluetoothConnect implements Runnable {    private static final UUID CONNECT_UUID  UUID.fromString("0000110100001000800000805F9B34FB");
Wesley13 Wesley13
2年前
MySQL部分从库上面因为大量的临时表tmp_table造成慢查询
背景描述Time:20190124T00:08:14.70572408:00User@Host:@Id:Schema:sentrymetaLast_errno:0Killed:0Query_time:0.315758Lock_
为什么mysql不推荐使用雪花ID作为主键
作者:毛辰飞背景在mysql中设计表的时候,mysql官方推荐不要使用uuid或者不连续不重复的雪花id(long形且唯一),而是推荐连续自增的主键id,官方的推荐是auto_increment,那么为什么不建议采用uuid,使用uuid究
Python进阶者 Python进阶者
3个月前
Excel中这日期老是出来00:00:00,怎么用Pandas把这个去除
大家好,我是皮皮。一、前言前几天在Python白银交流群【上海新年人】问了一个Pandas数据筛选的问题。问题如下:这日期老是出来00:00:00,怎么把这个去除。二、实现过程后来【论草莓如何成为冻干莓】给了一个思路和代码如下:pd.toexcel之前把这