⚓ T299947 Normalize pagelinks table
Page Menu
Phabricator
Create Task
Maniphest
T299947
Normalize pagelinks table
Closed, Resolved
Public
Actions
Edit Task
Edit Related Tasks...
Create Subtask
Edit Parent Tasks
Edit Subtasks
Merge Duplicates In
Close As Duplicate
Edit Related Objects...
Edit Commits
Edit Mocks
Mute Notifications
Protect as security issue
Assigned To
Ladsgroup
Authored By
Ladsgroup
Jan 24 2022, 6:40 PM
2022-01-24 18:40:11 (UTC+0)
Tags
MediaWiki-Page-derived-data
(Backlog)
MW-1.43-notes (1.43.0-wmf.11; 2024-06-25)
Data-Persistence
(Epic - Database)
User-notice-archive
(Backlog)
Patch-For-Review
MW-1.45-notes (1.45.0-wmf.6; 2025-06-17)
Referenced Files
None
Subscribers
Aklapper
BrandonXLF
BTullis
Certes
Cryptic
Huji
hut8
View All 23 Subscribers
Description
Once
T299417: Normalize templatelinks table
is done.
Details
Related Changes in Gerrit:
Subject
Repo
Branch
Lines +/-
Stop setting $wgPageLinksSchemaMigrationStage
operations/mediawiki-config
master
+0
-5
Remove pagelinks migration config
mediawiki/core
master
+14
-80
schema: Drop old pagelinks columns
mediawiki/core
master
+225
-93
Stop writing to old columns of pagelinks in most wikis
operations/mediawiki-config
master
+27
-5
Run pagelinks migration for third parties too
mediawiki/core
master
+45
-6
beta: Stop writing to the old columns of pagelinks in fawiki
operations/mediawiki-config
master
+1
-0
wikireplicas: Allow pagelinks.pl_target_id to be replicated to the cloud
operations/puppet
production
+2
-1
Customize query in gerrit
Related Objects
Search...
Task Graph
Mentions
Status
Subtype
Assigned
Task
Resolved
Ladsgroup
T300222
Implement normalizing MediaWiki link tables
Resolved
simon04
T365497
bldrwnsch update is broken – Unknown column 'pagelinks.pl_title'
Resolved
Ladsgroup
T299947
Normalize pagelinks table
Resolved
Ladsgroup
T342689
Add pl_target_id column to pagelinks
Resolved
ABran-WMF
T343198
Add pl_target_id column to pagelinks in production
Resolved
Ladsgroup
T345669
Add support for write both for pagelinks
Resolved
Ladsgroup
T345732
Turn on write both for beta and production
Resolved
Ladsgroup
T345733
Run MigrateLinksTable for pagelinks in beta and production
Resolved
JAllemandou
T345771
Adapt Sqoop to pagelinks schema change
Resolved
Ladsgroup
T346457
Add support for read new for pagelinks
Resolved
SBisson
T346784
Add support for read new for pagelinks in Wikistories
Resolved
Ladsgroup
T351237
Set beta and production to read new for pagelinks migration
Resolved
Ladsgroup
T352010
Gradually drop old pagelinks columns
Resolved
Marostegui
T364069
Rebuild pagelinks tables
Resolved
Marostegui
T367140
Switchover s3 master (db1223 -> db1157)
Resolved
Marostegui
T366687
Switchover s2 master (db1222 -> db1162)
Resolved
Marostegui
T367146
Switchover s5 master (db1230 -> db1183)
Resolved
Marostegui
T367145
Switchover s5 master (db2123 -> db2213)
Resolved
Marostegui
T368355
Switchover s8 master (db2165 -> db2161)
Resolved
Marostegui
T368371
Switchover s8 master (db1192 -> db1209)
Declined
Marostegui
T364067
Switchover s6 master (db1173 -> db1231)
Resolved
Marostegui
T363977
Switchover s8 master (db2165 -> db2161)
Resolved
Umherirrender
T364546
namespaceDupes is not respecting links migration stage (again)
Resolved
Marostegui
T364541
Switchover s8 master (db1209 -> db1192)
Resolved
Marostegui
T365787
db1155:3316 pagelinks table corrupted
Resolved
Marostegui
T369020
Switchover s6 master (db1231 -> db1173)
Resolved
BUG REPORT
Zabe
T419479
Upgrading pagelinks table makes pl_target_id NOT NULL, creating from scratch doesn’t
Mentioned In
T419479: Upgrading pagelinks table makes pl_target_id NOT NULL, creating from scratch doesn’t
T183490: MCR schema migration stage 4: Migrate External Store URLs (wmf production)
rXTRMRB573042a7357a: PageRepository: update query following pagelinks normalization
T378762: Edits seem to be much slower in REL1_43
T367940: Beta cluster fails with "LinksMigration config …WRITE_OLD on pagelinks … not supported anymore"
T366996: Large number wiki dumps with reduced/changed db schema for pagelinks.sql.gz
T352010: Gradually drop old pagelinks columns
rXT573042a7357a: PageRepository: update query following pagelinks normalization
T350431: Run maintenance scripts on Serbian projects
T350007: [M] Adapt image suggestions to comply with breaking database schema changes
T300222: Implement normalizing MediaWiki link tables
T297633:
T299953: Normalize imagelinks table
T299951: Normalize categorylinks table
Mentioned Here
T183490: MCR schema migration stage 4: Migrate External Store URLs (wmf production)
T363876: Research Tech News potential improvements
T222224: RFC: Normalize MediaWiki link tables
T352010: Gradually drop old pagelinks columns
T350007: [M] Adapt image suggestions to comply with breaking database schema changes
T14019: ifexist function uses pagelinks table for lack of better options
T312666: Remove duplication in externallinks table
T299417: Normalize templatelinks table
Event Timeline
There are a very large number of changes, so older changes are hidden.
Show Older Changes
BTullis
subscribed.
Nov 15 2023, 3:23 PM
2023-11-15 15:23:47 (UTC+0)
Comment Actions
We deployed
today with a maintain-views run on all wikireplica servers.
gerritbot
added a comment.
Nov 24 2023, 3:25 PM
2023-11-24 15:25:27 (UTC+0)
Comment Actions
Change 977218 had a related patch set uploaded (by Ladsgroup; author: Amir Sarabadani):
[operations/mediawiki-config@master] beta: Stop writing to the old columns of pagelinks in fawiki
gerritbot
added a project:
Patch-For-Review
Nov 24 2023, 3:25 PM
2023-11-24 15:25:28 (UTC+0)
Comment Actions
Change 977218
merged
by jenkins-bot:
[operations/mediawiki-config@master] beta: Stop writing to the old columns of pagelinks in fawiki
Maintenance_bot
removed a project:
Patch-For-Review
Nov 24 2023, 3:30 PM
2023-11-24 15:30:38 (UTC+0)
Ladsgroup
added a comment.
Nov 24 2023, 3:51 PM
2023-11-24 15:51:09 (UTC+0)
Comment Actions
I changed PK of pagelinks in fawiki in beta cluster:
ALTER TABLE pagelinks DROP PRIMARY KEY, ADD PRIMARY KEY (`pl_from`, `pl_target_id`);
Then, stopped writing the old columns and then dropped them and indexes:
ALTER TABLE pagelinks DROP COLUMN pl_namespace, DROP COLUMN pl_title;
DROP INDEX pl_namespace ON pagelinks;
DROP INDEX pl_backlinks_namespace ON pagelinks;
Size of the table went from 9.1M to 448K but the catch here is that majority of size redaction came from running optimize when changing the PK, making it go to 996K, so the net gain here is cutting size of the table to around 45% which is pretty good regardless.
Ladsgroup
added a comment.
Nov 27 2023, 9:51 AM
2023-11-27 09:51:46 (UTC+0)
Comment Actions
Actually my measurements were incorrect, the impact is bigger let me redo that.
Ladsgroup
added a comment.
Nov 27 2023, 10:42 AM
2023-11-27 10:42:22 (UTC+0)
Comment Actions
Running it on simplewiki gave 38% of the original size. On top, you'd get a clean up due to running optimize table which can in itself cut the size of the table from half to one fifth depending on the wiki.
Ladsgroup
reopened subtask
T343198: Add pl_target_id column to pagelinks in production
as
Open
Dec 7 2023, 6:02 PM
2023-12-07 18:02:35 (UTC+0)
Ladsgroup
closed subtask
T343198: Add pl_target_id column to pagelinks in production
as
Resolved
Dec 11 2023, 11:58 AM
2023-12-11 11:58:09 (UTC+0)
hut8
subscribed.
Dec 11 2023, 10:10 PM
2023-12-11 22:10:25 (UTC+0)
Ladsgroup
closed subtask
T346457: Add support for read new for pagelinks
as
Resolved
Dec 21 2023, 1:18 PM
2023-12-21 13:18:12 (UTC+0)
Umherirrender
mentioned this in
T350431: Run maintenance scripts on Serbian projects
Dec 21 2023, 5:50 PM
2023-12-21 17:50:53 (UTC+0)
Huji
subscribed.
Edited
Jan 20 2024, 8:22 PM
2024-01-20 20:22:02 (UTC+0)
Comment Actions
Fully understand the reasoning behind this normalization. But (unexpectedly) this will result in some queries to run substantially slower. For instance:
old query (3 seconds)
SELECT
page_title,
pl_title,
CASE
WHEN c2.cl_to IS NULL THEN NULL
ELSE '{{yes}}'
END AS under_construction
FROM page JOIN pagelinks
ON page_id = pl_from
LEFT JOIN categorylinks c2
ON page_id = c2.cl_from
AND c2.cl_to = 'صفحههای_گسترده_در_دست_ساخت'
WHERE
page_namespace = 0
AND pl_namespace IN (2, 3)
AND NOT EXISTS (
SELECT c1.cl_to
FROM categorylinks c1
WHERE
page_id = c1.cl_from
AND c1.cl_to = 'مقالههای_نامزد_حذف_سریع'
versus"
new query (3.5 minutes)
SELECT
page_title,
lt_title,
CASE
WHEN c2.cl_to IS NULL THEN NULL
ELSE '{{yes}}'
END AS under_construction
FROM page
JOIN pagelinks
ON page_id = pl_from
JOIN linktarget
ON pl_target_id = lt_id
LEFT JOIN categorylinks c2
ON page_id = c2.cl_from
AND c2.cl_to = 'صفحههای_گسترده_در_دست_ساخت'
WHERE
page_namespace = 0
AND lt_namespace IN (2, 3)
AND NOT EXISTS (
SELECT c1.cl_to
FROM categorylinks c1
WHERE
page_id = c1.cl_from
AND c1.cl_to = 'مقالههای_نامزد_حذف_سریع'
It would be nice if we had analytics on the queries run against
pagelinks
table (both in production and in replicas) and what the average or median query duration looks like over time.
Huji
added a comment.
Jan 20 2024, 8:30 PM
2024-01-20 20:30:47 (UTC+0)
Comment Actions
In looking at the EXPLAIN results for those two queries, the first one runs more quickly not only because of denormalization, but also because the WHERE condition on
pl_namespace
uses the
pl_namespace
index
on the
pagelinks
table. However, it seems like the secondary does *not* use the
lt_namespace_title
[[index]] on the
linktargets
table (where I expected it would be used as a partial index). Any ideas on how to improve my query or the indexes?
Ladsgroup
added a comment.
Jan 22 2024, 2:55 PM
2024-01-22 14:55:16 (UTC+0)
Comment Actions
Maybe this is what you're looking for:
SELECT
page_title
lt_title
CASE
WHEN
c2
cl_to
IS
NULL
THEN
NULL
ELSE
'{{yes}}'
END
AS
under_construction
FROM
page
JOIN
pagelinks
ON
page_id
pl_from
JOIN
linktarget
ON
pl_target_id
lt_id
LEFT
JOIN
categorylinks
c2
ON
page_id
c2
cl_from
WHERE
c2
cl_to
'صفحههای_گسترده_در_دست_ساخت'
AND
page_namespace
AND
lt_namespace
IN
AND
page_id
NOT
IN
SELECT
c1
cl_from
FROM
categorylinks
c1
WHERE
c1
cl_to
'مقالههای_نامزد_حذف_سریع'
(if I understood your query correctly, you want articles that are in daste sakht but not naamzade hazfe sari?)
Huji
added a comment.
Jan 23 2024, 1:10 AM
2024-01-23 01:10:32 (UTC+0)
Comment Actions
No. The query finds all articles linking to a User or User_talk page, and distinguishes which has an "under construction" category. So the last part is optional, not mandatory. Your suggested query makes it mandatory.
But thanks for taking a look.
Ladsgroup
added a comment.
Jan 24 2024, 12:24 AM
2024-01-24 00:24:09 (UTC+0)
Comment Actions
ah I see, unfortunately I don't think there is a way to make the query instantly fast as the knowledge that has the highest cardinality (pl from ns + pltarget ns) now is literally sitting in two different table and none of the fields have enough cardinality on their own to make query in the respective tables faster. I think it'll get a bit worse once the data is properly populated in linktarget (and more links to user and user talk ns start being added to linktarget) but OTOH, dropping the old columns makes all queries faster by increasing memory lookups and this took 48s for me:
SELECT
page_title
lt_title
CASE
WHEN
page_id
in
SELECT
cl_from
from
categorylinks
WHERE
cl_to
'صفحههای_گسترده_در_دست_ساخت'
THEN
'{{yes}}'
ELSE
NULL
END
AS
under_construction
FROM
page
JOIN
pagelinks
ON
page_id
pl_from
JOIN
linktarget
ON
pl_target_id
lt_id
WHERE
page_namespace
AND
pl_from_namespace
AND
lt_namespace
IN
AND
page_id
NOT
IN
SELECT
c1
cl_from
FROM
categorylinks
c1
WHERE
c1
cl_to
'مقالههای_نامزد_حذف_سریع'
(adding pl_from_namespace = 0 helps a bit)
Ladsgroup
added a comment.
Feb 13 2024, 11:52 AM
2024-02-13 11:52:00 (UTC+0)
Comment Actions
@Huji
Hi, the data has been fully populated for fawiki. If you want check and switch your tools and queries, it should be fine to do so now.
lbowmaker
closed subtask
T345771: Adapt Sqoop to pagelinks schema change
as
Resolved
Mar 6 2024, 9:39 PM
2024-03-06 21:39:06 (UTC+0)
MusikAnimal
mentioned this in
rXT573042a7357a: PageRepository: update query following pagelinks normalization
Apr 17 2024, 4:47 PM
2024-04-17 16:47:57 (UTC+0)
gerritbot
added a comment.
Apr 25 2024, 5:00 PM
2024-04-25 17:00:03 (UTC+0)
Comment Actions
Change #1024450 had a related patch set uploaded (by Ladsgroup; author: Amir Sarabadani):
[mediawiki/core@master] Run pagelinks migration for third parties too
gerritbot
added a project:
Patch-For-Review
Apr 25 2024, 5:00 PM
2024-04-25 17:00:04 (UTC+0)
gerritbot
added a comment.
Apr 27 2024, 9:56 PM
2024-04-27 21:56:42 (UTC+0)
Comment Actions
Change #1024450
merged
by jenkins-bot:
[mediawiki/core@master] Run pagelinks migration for third parties too
ReleaseTaggerBot
added a project:
MW-1.43-notes (1.43.0-wmf.3; 2024-04-30)
Apr 27 2024, 10:00 PM
2024-04-27 22:00:31 (UTC+0)
Maintenance_bot
removed a project:
Patch-For-Review
Apr 27 2024, 10:30 PM
2024-04-27 22:30:45 (UTC+0)
Ladsgroup
edited projects, added
User-notice
; removed
Platform Engineering
Apr 30 2024, 10:06 AM
2024-04-30 10:06:12 (UTC+0)
Comment Actions
We are about to start dropping the old columns everywhere.
Trizek-WMF
subscribed.
May 2 2024, 2:03 PM
2024-05-02 14:03:30 (UTC+0)
Comment Actions
What is this change, and what should be announced in Tech News (if anything)?
Wbm1058
subscribed.
May 4 2024, 5:44 PM
2024-05-04 17:44:25 (UTC+0)
Comment Actions
In
T299947#9756973
@Ladsgroup
wrote:
We are about to start dropping the old columns everywhere.
Does "everywhere" include
Manual:Pagelinks table
, which has yet to be updated this year and doesn't indicate that any columns are deprecated? Nor document the new way?
Huji
added a comment.
May 7 2024, 3:03 AM
2024-05-07 03:03:15 (UTC+0)
Comment Actions
In
T299947#9537244
@Ladsgroup
wrote:
@Huji
Hi, the data has been fully populated for fawiki. If you want check and switch your tools and queries, it should be fine to do so now.
I migrated all my code. In most cases, queries are running just a tad slower. In a few cases, they seem to run a tad faster. In all cases, my tests showed the results were identical.
gerritbot
added a comment.
May 7 2024, 9:43 AM
2024-05-07 09:43:38 (UTC+0)
Comment Actions
Change #1028778 had a related patch set uploaded (by Ladsgroup; author: Amir Sarabadani):
[operations/mediawiki-config@master] Stop writing to old columns of pagelinks in most wikis
gerritbot
added a project:
Patch-For-Review
May 7 2024, 9:43 AM
2024-05-07 09:43:39 (UTC+0)
gerritbot
added a comment.
May 7 2024, 3:31 PM
2024-05-07 15:31:47 (UTC+0)
Comment Actions
Change #1028778
merged
by jenkins-bot:
[operations/mediawiki-config@master] Stop writing to old columns of pagelinks in most wikis
Stashbot
added a comment.
May 7 2024, 3:32 PM
2024-05-07 15:32:32 (UTC+0)
Comment Actions
Mentioned in SAL (#wikimedia-operations)
[2024-05-07T15:32:32Z]
T352010
T299947
)]]
Stashbot
mentioned this in
T352010: Gradually drop old pagelinks columns
May 7 2024, 3:32 PM
2024-05-07 15:32:35 (UTC+0)
Stashbot
added a comment.
May 7 2024, 3:38 PM
2024-05-07 15:38:12 (UTC+0)
Comment Actions
Mentioned in SAL (#wikimedia-operations)
[2024-05-07T15:38:11Z]
T352010
T299947
)]] synced to the testservers (
Stashbot
added a comment.
May 7 2024, 4:05 PM
2024-05-07 16:05:07 (UTC+0)
Comment Actions
Mentioned in SAL (#wikimedia-operations)
[2024-05-07T16:05:02Z]
T352010
T299947
)]] (duration: 32m 29s)
Ladsgroup
added a comment.
May 7 2024, 4:09 PM
2024-05-07 16:09:05 (UTC+0)
Comment Actions
In
T299947#9764013
@Trizek-WMF
wrote:
What is this change, and what should be announced in Tech News (if anything)?
Something along the lines of:
Two columns of pagelinks table (pl_namespace, pl_title) are being dropped soon. Users must use pl_target_id instead (foreign key to linktarget table). See
T222224
for reasoning.
Maintenance_bot
removed a project:
Patch-For-Review
May 7 2024, 4:31 PM
2024-05-07 16:31:16 (UTC+0)
Quiddity
subscribed.
May 10 2024, 1:19 AM
2024-05-10 01:19:21 (UTC+0)
Comment Actions
@Ladsgroup
Added to
-- Thanks for the draft wording, always appreciated!
I think I've gotten the highlighting and non-translation snippets correct, but please glance-check.
It will be frozen for translations in ~20 hours, if any tweaks (or link additions/changes) are needed before then.
Quiddity
moved this task from
To Triage
to
In current Tech/News draft
on the
User-notice
board.
May 10 2024, 1:19 AM
2024-05-10 01:19:31 (UTC+0)
Wbm1058
added a comment.
May 10 2024, 3:31 AM
2024-05-10 03:31:29 (UTC+0)
Comment Actions
It doesn't work
Can't use pagelinks without linktarget; can't use linktarget without pagelinks
Chicken and egg problem
Ladsgroup
added a comment.
May 10 2024, 9:22 AM
2024-05-10 09:22:02 (UTC+0)
Comment Actions
In
T299947#9785030
@Wbm1058
wrote:
It doesn't work
Can't use pagelinks without linktarget; can't use linktarget without pagelinks
Chicken and egg problem
The join columns were wrong: Here is the correct query:
Wbm1058
added a comment.
May 10 2024, 12:00 PM
2024-05-10 12:00:03 (UTC+0)
Comment Actions
Thanks. So Quiddity, the advice: "Users must use
pl_target_id
instead (a foreign key to the
linktarget
table)." wasn't sufficient to clearly explain to me what I needed to do. I think, to better explain for rusty SQL users like me:
Two columns of the
pagelinks
table (
pl_namespace
pl_title
) are being dropped soon. Users must use two columns of the new
linktarget
table instead (
lt_namespace
lt_title
).
In your existing SQL queries:
Replace
JOIN pagelinks
with
JOIN linktarget
and
pl_
with
lt_
in the
ON
statement
Below that add
JOIN pagelinks ON lt_id = pl_target_id
Quiddity
added a comment.
May 10 2024, 9:32 PM
2024-05-10 21:32:39 (UTC+0)
Comment Actions
Thanks for the detailed recommendation! Updated at
Sidenote (Tech News insider stuff): I'm hesitant about including that much technical detail (as it's a burden for the translators, and confusing + bulky for those completely unfamiliar with the topic...), but we're trying to improve Tech News (cf.
T363876
) so we need to experiment with some aspects in order to get clear example-based feedback, which I think this will help with. :)
Quiddity
moved this task from
In current Tech/News draft
to
Already announced/Archive
on the
User-notice
board.
May 15 2024, 6:39 PM
2024-05-15 18:39:10 (UTC+0)
simon04
added a subtask:
T365497: bldrwnsch update is broken – Unknown column 'pagelinks.pl_title'
May 21 2024, 3:52 PM
2024-05-21 15:52:54 (UTC+0)
simon04
closed subtask
T365497: bldrwnsch update is broken – Unknown column 'pagelinks.pl_title'
as
Resolved
May 21 2024, 3:59 PM
2024-05-21 15:59:53 (UTC+0)
JJMC89
removed a subtask:
T365497: bldrwnsch update is broken – Unknown column 'pagelinks.pl_title'
May 22 2024, 8:54 PM
2024-05-22 20:54:18 (UTC+0)
JJMC89
added a parent task:
T365497: bldrwnsch update is broken – Unknown column 'pagelinks.pl_title'
Ladsgroup
closed subtask
T345733: Run MigrateLinksTable for pagelinks in beta and production
as
Resolved
May 28 2024, 5:03 PM
2024-05-28 17:03:29 (UTC+0)
Ladsgroup
closed subtask
T351237: Set beta and production to read new for pagelinks migration
as
Resolved
May 28 2024, 5:37 PM
2024-05-28 17:37:30 (UTC+0)
Pppery
mentioned this in
T366996: Large number wiki dumps with reduced/changed db schema for pagelinks.sql.gz
Jun 8 2024, 8:46 PM
2024-06-08 20:46:12 (UTC+0)
gerritbot
added a comment.
Jun 14 2024, 12:20 PM
2024-06-14 12:20:45 (UTC+0)
Comment Actions
Change #1043756 had a related patch set uploaded (by Ladsgroup; author: Amir Sarabadani):
[mediawiki/core@master] schema: Drop old pagelinks columns
gerritbot
added a project:
Patch-For-Review
Jun 14 2024, 12:20 PM
2024-06-14 12:20:46 (UTC+0)
Physikerwelt
subscribed.
Jun 18 2024, 6:23 AM
2024-06-18 06:23:40 (UTC+0)
Comment Actions
I am running MW 1_43-wmf9 and started running update.php yesterday night. It seems that the updater is very slow (running migrateLinksTable on pagelinks). I now started to run this migration in a separate process. Do you have any idea how long the migration might take? My first estimation is 12 days for our 12M pages. This seems very slow, does that make sense?
Ladsgroup
added a comment.
Jun 18 2024, 9:51 AM
2024-06-18 09:51:15 (UTC+0)
Comment Actions
In
T299947#9902127
@Physikerwelt
wrote:
I am running MW 1_43-wmf9 and started running update.php yesterday night. It seems that the updater is very slow (running migrateLinksTable on pagelinks). I now started to run this migration in a separate process. Do you have any idea how long the migration might take? My first estimation is 12 days for our 12M pages. This seems very slow, does that make sense?
It depends on how connected those pages are (=size of pagelinks table). 12 days for 12M sounds about right. You could potentially increase the batch size if they get connected to one set of popular pages mostly.
Physikerwelt
added a comment.
Edited
Jun 18 2024, 7:16 PM
2024-06-18 19:16:25 (UTC+0)
Comment Actions
Thank you. This is very helpful. I think one of the problems is that we are also writing to the wiki simultaneously.
MariaDB [(none)]> SELECT table_name AS `Table`, round(((data_length + index_length) / 1024 / 1024 / 1024), 2) `Size in GB` FROM information_schema.TABLES where table_name like '%links' order by (data_length + index_length) desc;
+---------------+------------+
| Table | Size in GB |
+---------------+------------+
| pagelinks | 22.43 |
| externallinks | 9.92 |
| templatelinks | 7.45 |
| categorylinks | 0.09 |
| iwlinks | 0.01 |
And it is a bit faster than originally expected. So maybe only 6 days. (Edit: It was actually much faster real 1510m58.783s)
gerritbot
added a comment.
Jun 18 2024, 8:28 PM
2024-06-18 20:28:10 (UTC+0)
Comment Actions
Change #1043756
merged
by jenkins-bot:
[mediawiki/core@master] schema: Drop old pagelinks columns
Maintenance_bot
removed a project:
Patch-For-Review
Jun 18 2024, 8:30 PM
2024-06-18 20:30:48 (UTC+0)
ReleaseTaggerBot
edited projects, added
MW-1.43-notes (1.43.0-wmf.11; 2024-06-25)
; removed
MW-1.43-notes (1.43.0-wmf.3; 2024-04-30)
Jun 18 2024, 9:00 PM
2024-06-18 21:00:31 (UTC+0)
thiemowmde
mentioned this in
T367940: Beta cluster fails with "LinksMigration config …WRITE_OLD on pagelinks … not supported anymore"
Jun 19 2024, 7:27 AM
2024-06-19 07:27:59 (UTC+0)
CodeReviewBot
added a project:
Patch-For-Review
Jul 6 2024, 6:01 PM
2024-07-06 18:01:43 (UTC+0)
Comment Actions
tacsipacsi opened
Update pagelinks access
Tacsipacsi
subscribed.
Jul 6 2024, 6:14 PM
2024-07-06 18:14:36 (UTC+0)
Comment Actions
I didn’t want to spam here… It looks like
@CodeReviewBot
is smarter than I thought. Sorry!
Physikerwelt
unsubscribed.
Jul 6 2024, 6:23 PM
2024-07-06 18:23:12 (UTC+0)
Ladsgroup
closed this task as
Resolved
Jul 28 2024, 10:10 PM
2024-07-28 22:10:52 (UTC+0)
Ladsgroup
claimed this task.
Ladsgroup
closed subtask
T352010: Gradually drop old pagelinks columns
as
Resolved
Ladsgroup
edited projects, added
Data-Persistence
; removed
Patch-For-Review
Ladsgroup
moved this task from
Inbox
to
Epic - Database
on the
Data-Persistence
board.
Maintenance_bot
edited projects, added
User-notice-archive
; removed
User-notice
Aug 7 2024, 10:30 PM
2024-08-07 22:30:29 (UTC+0)
CodeReviewBot
added a comment.
Aug 27 2024, 12:28 AM
2024-08-27 00:28:00 (UTC+0)
Comment Actions
legoktm merged
Update pagelinks access
Physikerwelt
mentioned this in
T378762: Edits seem to be much slower in REL1_43
Oct 31 2024, 7:58 PM
2024-10-31 19:58:01 (UTC+0)
MusikAnimal
mentioned this in
rXTRMRB573042a7357a: PageRepository: update query following pagelinks normalization
Mar 20 2025, 3:18 AM
2025-03-20 03:18:09 (UTC+0)
gerritbot
added a comment.
Jun 15 2025, 5:09 PM
2025-06-15 17:09:56 (UTC+0)
Comment Actions
Change #1158639 had a related patch set uploaded (by Zabe; author: Zabe):
[mediawiki/core@master] Remove pagelinks migration config
gerritbot
added a project:
Patch-For-Review
Jun 15 2025, 5:09 PM
2025-06-15 17:09:57 (UTC+0)
gerritbot
added a comment.
Jun 15 2025, 8:17 PM
2025-06-15 20:17:03 (UTC+0)
Comment Actions
Change #1158639
merged
by jenkins-bot:
[mediawiki/core@master] Remove pagelinks migration config
gerritbot
added a comment.
Jun 15 2025, 8:54 PM
2025-06-15 20:54:41 (UTC+0)
Comment Actions
Change #1158804 had a related patch set uploaded (by Zabe; author: Zabe):
[operations/mediawiki-config@master] Stop setting $wgPageLinksSchemaMigrationStage
ReleaseTaggerBot
added a project:
MW-1.45-notes (1.45.0-wmf.6; 2025-06-17)
Jun 15 2025, 9:01 PM
2025-06-15 21:01:06 (UTC+0)
gerritbot
added a comment.
Jun 16 2025, 9:44 AM
2025-06-16 09:44:24 (UTC+0)
Comment Actions
Change #1158804
merged
by jenkins-bot:
[operations/mediawiki-config@master] Stop setting $wgPageLinksSchemaMigrationStage
Stashbot
added a comment.
Jun 16 2025, 9:45 AM
2025-06-16 09:45:08 (UTC+0)
Comment Actions
Mentioned in SAL (#wikimedia-operations)
[2025-06-16T09:45:07Z]
T183490
)]], [[gerrit:1158804|Stop setting $wgPageLinksSchemaMigrationStage (
T299947
)]]
Stashbot
mentioned this in
T183490: MCR schema migration stage 4: Migrate External Store URLs (wmf production)
Jun 16 2025, 9:45 AM
2025-06-16 09:45:09 (UTC+0)
Comment Actions
Mentioned in SAL (#wikimedia-operations)
[2025-06-16T09:47:00Z]
T183490
)]], [[gerrit:1158804|Stop setting $wgPageLinksSchemaMigrationStage (
T299947
)]] synced to the testservers (see
). Changes can now be verified there.
Stashbot
added a comment.
Jun 16 2025, 9:57 AM
2025-06-16 09:57:54 (UTC+0)
Comment Actions
Mentioned in SAL (#wikimedia-operations)
[2025-06-16T09:57:54Z]
T183490
)]], [[gerrit:1158804|Stop setting $wgPageLinksSchemaMigrationStage (
T299947
)]] (duration: 12m 46s)
Elcapitan68
created subtask
T419479: Upgrading pagelinks table makes pl_target_id NOT NULL, creating from scratch doesn’t
Mar 9 2026, 9:23 PM
2026-03-09 21:23:18 (UTC+0)
Elcapitan68
mentioned this in
T419479: Upgrading pagelinks table makes pl_target_id NOT NULL, creating from scratch doesn’t
Zabe
closed subtask
T419479: Upgrading pagelinks table makes pl_target_id NOT NULL, creating from scratch doesn’t
as
Resolved
Mar 19 2026, 11:47 PM
2026-03-19 23:47:54 (UTC+0)
Log In to Comment
Content licensed under Creative Commons Attribution-ShareAlike (CC BY-SA) 4.0 unless otherwise noted; code licensed under GNU General Public License (GPL) 2.0 or later and other open source licenses. By using this site, you agree to the Terms of Use, Privacy Policy, and Code of Conduct.
Wikimedia Foundation
Code of Conduct
Disclaimer
CC-BY-SA
GPL
Credits
US