Migrating from PostgreSQL to MySQL at Cocolog

Migrating from PostgreSQL to MySQLat Cocolog

Naoto Yokoyama, NIFTY Corporation

Garth Webb, Six Apart

Lisa Phillips, Six Apart

Credits:Kenji Hirohama, Sumisho Computer Systems Corp.

Agenda

1. What is Cocolog

2. History of Cocolog

3. DBP: Database Partitioning

4. Migration From PostgreSQL to MySQL

1. What is Cocolog

What is Cocolog

NIFTY Corporation

Established in 1986

A Fujitsu Group Company

NIFTY-Serve (licensed and interconnected with CompuServe)

One of the largest ISPs in Japan

Cocolog

First blog community at a Japanese ISP

Based on TypePad technology by SixApart

Several hundred million PV/month

History

Dec/02/2003: Cocolog for ISP users launch

Nov/24/2005: Cocolog Free for free launch

April/05/2007: Cocolog for Mobile Phone launch

2008/04

700 Thousand Users

Cocolog (Screenshot of home page)

TypePadCocolog

Cocolog template sets

Cocolog Growth (User) ■Cocolog ■Cocolog Free

phase1 phase2 phase3 phase4

Cocolog Growth (Entry) ■Cocolog ■Cocolog Free

phase1 phase2 phase3 phase4

Technology at Cocolog

Core System

Linux 2.4/2.6

Apache 1.3/2.0/2.2 ＆ mod_perl

Perl 5.8+CPAN

PostgreSQL 8.1

MySQL 5.0

memcached/TheSchwartz/cfengine

Eco System

LAMP,LAPP,Ruby+ActiveRecord, Capistrano

Etc...

Monitoring

Management Tool Proprietary in-house development with PostgreSQL, PHP,

and Perl

Monitoring points (order of priority) response time of each post

number of spam comments/trackbacks

number of comments/trackbacks

source IP address of spam

number of entries

number of comments via mobile devices

page views via mobile devices

time of batch completion

amount of API usage

bandwidth usage

DB Disk I/O

Memory and CPU usage

time of VACUUM analyze

APP number of active processes

CPU usage

Memory usage

Service

2. History of Cocolog

Phase1 2003/12～(Entry: 0.04Million)

Register

Postgre

Static contents

Published

Before DBP

10servers

TypePad

Podcast

Portal

Profile

Phase2 2004/12～ (Entry: 7Million)

Rich templatePublish Book

Tel Operator

Support

Static contents

Published

Postgre

SQLRegister

TypePad2004/12～

2005/5～

Before DBP

50servers

Phase2 - Problems

The system is tightly coupled.

Database server is receiving from multiple

points.

It is difficult to change the system design and

database schema.

Static contents

Published

Web-API

memcached

Podcast

Portal

Profile

Postgre

Tel Operator

Support

RegisterTypePad

Before DBP

200servers

Web-API

Static contents

Published

memcached

Mobile

Tel Operator

Support

Register

Typepad

Postgre

Before DBP

300servers

Now 2008/4～

Web-API

Static contents

Published

memcached

Mobile

Typepad

Tel Operator

Support

Register

After DBP

150servers

3. TypePad Database Partitioning

Steps for Transitioning

• Server PreparationHardware and software setup

• Global WriteWrite user information to the global DB

• Global ReadRead/write user information on the global DB

• Move SequenceTable sequences served by global DB

• User Data MoveMove user data to user partitions

• New User PartitionAll new users saved directly to user partition 1

• New User StrategyDecide on a strategy for the new user partition

• Non User Data MoveMove all non-user owned data

Storage

TypePad Overview (PreDBP)

Database

(Postgres)

Static Content (HTML,

Images, etc)

Application

Server

TypeCast

Server

ServerMEMCACHED

Data Caching servers to

reduce DB load

Dedicated Server for

TypeCast (via ATOM)

https(443)

http(80)

http(80) : atom apimemcached(11211)

postgres(5432)

Server

Internet

nfs(2049)

ADMIN(CRON)

Server

smtp(25) / pop(110)Blog Readers

Blog Owners

Mobile Blog

Readers

smtp(25) / pop(110)

Cron Server for periodic

asynchronous tasks

TypePad

User Role

Why Partition?

TypePad

User Role

(User0)

All inquires (access) go to one

DB(Postgres)

After DBPCurrent setup

Inquiries (access) are divided among

several DB(MySQL)

TypePad

TypePadTypePad

Global

Non-User

User Role

(User1)

User Role

(User2)

User Role

(User3)

User Role

Server Preparation

TypePad

User Role

(User0)

DB(PostgreSQL)

User Role

(User1)

User Role

(User2)

User Role

(User3)

Global

Non-User

New expanded setup

DB(MySQL) for partitioned data

Current Setup

Job Server

+ TypePad

+ Schwartz

Schwartz

User information is

partitioned

Maintains user mapping

and primary key generation Stores job

details

Server for

executing Jobs

※Grey areas are not used in current

Asynchronous Job Server

Information that does not

need to be partitioned

(such as session

information)

Global WriteCreating the user map

User Role

TypePad

User Role

(User0)

DB(PostgreSQL)

User Role

(User1)

User Role

(User2)

User Role

(User3)

Global

Non-User

Job Server

+ TypePad

+ Schwartz

Schwartz

Explanation

①：For new registrations only, uniquely identifying user data is written to the global DB

②：This same data continues to be written to the existing DB

and primary key generation

※Grey areas are not used in current steps

Global ReadUse the user map to find the user partition

User Role

TypePad

User Role

(User0)

DB(PostgreSQL)

User Role

(User1)

User Role

(User2)

User Role

(User3)

Global

Non-User

Job Server

+ TypePad

+ Schwartz

Schwartz

Explanation

①：Migrate existing user data to the global DB

②：At start of the request, the application queries global DB for the location of user data

③：The application then talks to this DB for all queries about this user. At this stage the global DB points

to the user0 partition in all cases.

Migrate existing

user data

Move SequenceMigrating primary key generation

User Role

TypePad

User Role

(User0)

DB(PostgreSQL)

User Role

(User1)

User Role

(User2)

User Role

(User3)

Global

Non-User

Job Server

+ TypePad

+ Schwartz

Schwartz

Explanation

①：Postgres sequences (for generating unique primary keys) are migrated to tables on the global DB that

act as “pseudo-sequences”.

② Application requests new primary keys from global DB rather than the user partition.

Migrate sequence

management

User Data MoveMoving user data to the new user-role partitions

User Role

TypePad

User Role

(User0)

DB(PostgreSQL)

User Role

(User1)

User Role

(User2)

User Role

(User3)

Global

Non-User

Job Server

+ TypePad

+ Schwartz

Schwartz

Explanation

①：Existing users that should be migrated by Job Server are submitted as new Schwartz jobs. User data is

then migrated asynchronously

②：If a comment arrives while the user is being migrated, it is saved in the Schwartz DB to be published later.

③：After being migrated all user data will exist on the user-role DB partitions

④：Once all user data is migrated, only non-user data is on Postgres

Stores job

details

Server for

executing Jobs

User information is

partitioned

Migrating each

user data

New User PartitionNew registrations are created on one user role partition

User Role

TypePad

User Role

(User0)

DB(PostgreSQL)

User Role

(User1)

User Role

(User2)

User Role

(User3)

Global

Non-User

Job Server

+ TypePad

+ Schwartz

Schwartz

Explanation

①：When new users register, user data is written to a user role partition.

②：Non-user data continues to be served off Postgres

User information is

partitioned

New User StrategyPick a scheme for distributing new users

User Role

TypePad

User Role

(User0)

DB(PostgreSQL)

User Role

(User1)

User Role

(User2)

User Role

(User3)

Global

Non-User

Job Server

+ TypePad

+ Schwartz

Schwartz

Explanation

①：When new users register, user data is written to one of the user role partitions, depending on a set

distribution method (round robin, random, etc)

②：Non-user data continues to be served off Postgres

User information is

partitioned

Non User Data MoveMigrate data that cannot be partitioned by user

User Role

TypePad

User Role

(User0)

DB(PostgreSQL)

User Role

(User1)

User Role

(User2)

User Role

(User3)

Global

Non-User

Job Server

+ TypePad

+ Schwartz

Schwartz

Explanation

①：Migrate non-user role data left on PostgreSQL to the MySQL side.

User information is

partitioned

Migrate non-User

(such as session

information)

Data migration done

User Role

TypePad

User Role

(User0)

DB(Postgres)

User Role

(User1)

User Role

(User2)

User Role

(User3)

Global

Non-User

Job Server

+ TypePad

+ Schwartz

Schwartz

Explanation

①：All data access is now done through MySQL

②：Continue to use The Schwartz for asynchronous jobs

Stores job

details

Server for

executing Jobs

User information is

partitioned

②Asynchronous Job Server

(such as session

information)

Storage

The New TypePad configuration

Database

(MySQL)

Static Content

(HTML,

Images, etc)

Application

Server

TypeCast

Server

ServerMEMCACHED

Data Caching servers to

reduce DB load

Dedicated Server for

TypeCast (via ATOM)

https(443)

http(80)

http(80) : atom api

memcached(11211)

MySQL(3306)

Server

Internet

nfs(2049)

ADMIN(CRON)

Server

smtp(25) / pop(110)

Blog Readers

Blog Owners

(management

interface)

Mobile Blog

Readers

smtp(25) / pop(110)

Cron Server for periodic

asynchronous tasks

Server

TheSchwartz server for

running ad-hoc jobs

asynchronously

4. Migration from PostgreSQL to MySQL

DB Node Spec History

Time OS(RedHat) CPU Xeon MEM DiskArray

2003/12

2007/11

7.4(2.4.9) 1.8GHz/512k×1 1GB No

ES2.1(2.4.9) 3.2GHz/1M×2 4GB No

ES2.1(2.4.9) 3.2GHz/1M×2 4GB Yes

AS2.1(2.4.9) 3.2GHz/1M×4 12G

AS4 (2.6.9) 3.2GHz/1M×4 12G

AS4 (2.6.9) MP3.3GHz/1M×4

〔2Core×4〕

History of scale up PostgreSQL server, Before DBP

DB DiskArray Spec

[FUJITSU ETERNUS8000]

Best I/O transaction performance in the world

146GB (15 krpm) * 32disk with RAID - 10

MultiPath FibreChannel 4Gbps

QuickOPC (One Point Copy)

OPC copy functions let you create a duplicate copy

of any data from the original at any chosen time.

http://www.computers.us.fujitsu.com/www/pro

ducts_storage.shtml?products/storage/fujitsu/

e8000/e8000

History of scale up PostgreSQL server, Before DBP

Scale out MySQL servers, After DBP

A role configuration

Each role is configured as HA cluster

HA Software: NEC ClusterPro

Shared Storage

Postgre

FibreChannel SAN

DiskArray

heart beat

TypePad

Application

Backup

Replication w/ Hot backup

Postgre

FibreChannel SAN

DiskArray

heart beat

BackupRole

TypePad

Application

mysqld mysqld mysqld

rep rep rep

mysqld

Troubles with PostreSQL 7.4 – 8.1

Data size

over 100 GB

40% is index

Severe Data Fragmentation

VACUUM

“VACUUM analyze” cause the performance problem

Takes too long to VACUUM large amounts of data

dump/restore is the only solution for de-fragmentation

Auto VACUUM

We don’t use Auto VACUUM since we are worried about

latent response time

Troubles with PostgreSQL 7.4 – 8.1

Character set

PostgreSQL allow the out of boundary UTF-8

Japanese extended character sets and multi

bytes character sets which normally should

come back with an error - instead of

accepting them.

“Cleaning” data

Removing characters set that are out of the

boundries UTF-8 character sets.

PostgreSQL.dumpALL

Split for Piconv

UTF8 -> UCS2 -> UTF8 & Merge

PostgreSQL.restore

dump Split UTF8->UCS2->UTF8 Mergerestore

TypePadTypePad

Migration from PostgreSQL to MySQL using TypePad script

PostgreSQL -> PerlObject & tmp publish

-> MySQL -> PerlObject & last publish

diff tmp ＆ last Object （data check）

diff tmp ＆ last publish （file check）

PostgreSQL

Document

Object

Document

Object

lastFile check

data check

Troubles with MySQL

convert_tz function

doesn't support the input value outside the

scope of Unix Time

sort order

different sort order without “order by” clause

Cocolog Future Plans

Dynamic

Job queue

Consulting by

Sumisho Computer Systems Corp.

System Integrator

first and best partner of MySQL in Japan

since 2003

provide MySQL consulting, support, training

service

Maintenance

online backup

Japanese character support

Questions

Migrating from PostgreSQL to MySQL at Cocolog

Technology

Transcript of Migrating from PostgreSQL to MySQL at Cocolog

Migration to PostgreSQL - preparation and … · Overview Oracle to PostgreSQL Informix to PostgreSQL MySQL to PostgreSQL MSSQL to PostgreSQL Replication and/or High Availability

Visiting The Catalog - PostgreSQL · Maintenance of all databases at KOF: PostgreSQL, Oracle, MySQL and MSSQL Server. Focus on migrating to PostgreSQL Support in business process

DB2 UDB To PostgreSQL Conversion Guide · PDF fileDB2 UDB To PostgreSQL Conversion Guide Version 1.0 1. Introduction Since migrating from DB2 UDB to PostgreSQL requires a certain level

Migrating a PostgreSQL Database to SQL Anywhere 12ftp2.ianywhere.jp/tech/Migration_from_PostgreSQL_To_SQ...Migrating a PostgreSQL Database to SQL Anywhere 12 December 2011 5 spaces

Migration to PostgreSQL - preparation and methodology · 2020-02-02 · Overview Oracle to PostgreSQL Informix to PostgreSQL MySQL to PostgreSQL MSSQL to PostgreSQL Replication and/or

pga00374.cocolog-nifty.compga00374.cocolog-nifty.com/blog/files/e696b0e38387e382b6... · Web view2018.04.23---2020.09.29 Ver.13.12. 第1部 現代デザインの矛盾とその歴史的根拠.

Migrating from PostgreSQL to MySQL at Cocolog Naoto Yokoyama, NIFTY Corporation Garth Webb, Six Apart Lisa Phillips, Six Apart Credits: Kenji Hirohama,

Migrating from MySQL to PostgreSQL

Experiences from Migrating from Oracle to PostgreSQL2017.pgconf.in/wp-content/uploads/2016/05/Experiences_From... · Experiences from Migrating from Oracle to PostgreSQL ... The Oracle

Migrating a PostgreSQL Database to SQL Anywhere 12...PostgreSQL database into a SQL Anywhere database using the Sybase Central Data Migration wizard. Finally, the third part of this

Fuzzy Matching In PostgreSQL - schmiedewerkstatt.ch · Focus on migrating to PostgreSQL Support in business process re-engineering Co-founder and treasurer of the SwissPUG, the Swiss

Securing PostgreSQL Exploring the PostgreSQL STIG · PDF fileSecuring PostgreSQL { Exploring the PostgreSQL STIG and ... $ systemctl restart postgresql-9.5 ... Securing PostgreSQL

Migrating to PostgreSQL · 2019-10-24 · Migrating to PostgreSQL / PgConf.EU Milano, 16 October 2019 Take Advantage of the Application Some applications support multiple databases

Table of Contents - Stratoscale...Stratoscale’s annual Oracle to PostgreSQL Migration Report compares the challenges and best practices of companies before and after migrating from

産業連関分析 - okh95.cocolog-nifty.com

The PostgreSQL Global Development Group · 2019-09-26 · PostgreSQL 9.5.19 Documentation The PostgreSQL Global Development Group. PostgreSQL 9.5.19 Documentation by The PostgreSQL

What’s New in PostgreSQL 9.6, by PostgreSQL contributor · Hive PostgreSQL 9.1.3 PostgreSQL 9.6.0 •Comparing DBT-3 benchmark result with Hive (SF1 - SF100) •Single PostgreSQL

pga00374.cocolog-nifty.compga00374.cocolog-nifty.com/blog/files/e696b0e38387e382b6... · Web view2018.04.23---2020.08.27 Ver.11.19 第1部 現代デザインの矛盾とその歴史的根拠

Red Hat Software Collections 3...5.7.2. Migrating from a Red Hat Enterprise Linux System Version of PostgreSQL to the PostgreSQL 9.6 Software Collection 5.7.3. Migrating from the PostgreSQL

PostgreSQL for developers · PostgreSQL for developers Dimitri Fontaine PostgreSQL Major Contributor A BOOK ABOUT POSTGRESQL BY DIMITRI FONTAINE

pga00374.cocolog-nifty.compga00374.cocolog-nifty.com/blog/files/e696b0e38387e382b6... · Web view2018.04.23---2020.09.29 Ver.13.12. 第1部現代デザインの矛盾とその歴史的根拠.

pga00374.cocolog-nifty.compga00374.cocolog-nifty.com/blog/files/e696b0e38387e382b6... · Web view2018.04.23---2020.08.27 Ver.11.19 第1部現代デザインの矛盾とその歴史的根拠