Game advertisements by <a href="http://www.game-advertising-online.com" target="_blank">Game Advertising Online</a> require iframes.

Game Update 62: Age of Discovery

Game Update 61: The War of Zek

Advertisements by Google

 

Rothgar has put forth a possible server merge scenario for folks on the Lucan D’Lere server. This is not a merger that would be taken lightly, as Lucan D’Lere is a Role-play server and there is currently no server that its members could merge to that wouldn’t result in an overpopulated server. It will be interesting to read player reaction to the idea of merging with a non-RP server.

From the Eq2 Forums:

We are still considering a merger for LDL and currently Crushbone would be the ideal candidate due to the resulting population numbers. Crushbone was one of the highest population servers prior to the merges, but now its population is just a little under the rest of the servers. It still has a healthy population on its own, but the numbers make sense to look at a possible LDL / Crushbone merge.

A combined Crushbone / LDL would have a population similar to Antonia Bayle and I think that would be a really good thing for both servers.

 

Things are moving fast in the Server Merge process, and even we are having trouble keeping up. From Rothgar:

We plan to have all U.S merges done by Monday night, the 20th. International merges will come after the first of the year due to database maintenance that must be performed before we can start the merges.

Wednesday night at midnight we will perform two more mergers. Then Sunday night at midnight we will perform the final two U.S. mergers.

I’ll post the server names sometime tomorrow with the corresponding day that we are planning to do the merge.

Rothgar provided some more in-depth understanding of the Server Merge decision-making

The servers that are merging had very similar population numbers to begin with but I took the highest server and merged it with the lowest, and so forth. This was done to try to normalize the final populations as much as possible.

Also, I was going based on peak concurrent population, not the number of accounts or characters which would have skewed things towards the older servers.

In the end, all servers that were merged should be very similar in population to the new Everfrost.

Also, regarding the post about [Lucan D'Lere]… I’ve mentioned it a few times on here that we will look at the remaining servers as soon as the first set of merges are done and make a decision at that point. There’s nothing else I can really say about it beyond that. We have the new merge process that should be pretty flawless by the time we’re done and our goal is for all servers to have a healthy population. Obviously some servers are limited by contracts and rulesets which make it a little tougher. Fortunately the issue with LDL is simpler than with Live Gamer servers or PvP servers. But, we still need to decide what type of server to merge LDL with. I don’t think its possible at this time to merge them with AB without making some adjustments on the back end.

As Lucan D’Lere is a Roleplay server and the only other RP server is Antonia Bayle, this is a tricky decision.

 

Rothgaar tagged along with Dracos Argent last night during our raid to view the lag that was plaguing many of Butcherblock raid and heroic instances.

While we  were fighting Waansu he was able to diagnose the lag after seeing a 6 second lag spike in during one of our pulls. After doing some research he found that instances from Butcherblock were being loaded on to servers already heavily loaded due to an issue with the load balancing server.

Rothgaar explained the following during our raid.

Rothgar Shouts, “It appears that all of butcherblocks instanced content is being routed to an overloaded dynamic cluster, but all of the overland zones seem to be on the standard cluster.”

“I have looked at many butcherblock instances and they all are running on the same overloaded cluster while the normal cluster has available processing power. So its defiantly an issue with the world manager balancing zones in the wrong place.”

Later he posted on the official forums the following:

The fix is complete and I’m seeing instances for Lucan and Butcherblock being properly load-balanced now.  Any existing instances will need to shut down and be restarted for them to get moved to less-busy servers.  New instances will be starting up just fine and should enjoy a somewhat lag-free environment.

It generally takes an instance about 20 minutes to shut down after everyone leaves so if your zone is currently unplayable and don’t mind a 20 minute break, you should be able to resume shortly.

We’re very sorry for the poor performance this undoubtedly caused over the weekend for BB and LDL players.  I’ll see if I can put Brenlo on the spot for some bonus XP.

Unfortunately, he followed his post in the forums with this:

Sorry to be the bearer of bad news, but this issue was something that started on Friday.  We’re aware that there are still lag issues and I by no means meant to imply that this last fix “fixed lag”.  There was a definite problem with load balancing that was making the problem much worse and that has been fixed.  You’ll probably still see lag issues that were there prior to Friday and we are working on tracking those down.

http://forums.station.sony.com/eq2/posts/list.m?start=75&topic_id=476210

I would like to thank Rothgaar for his time to come to our raid to view the lag issue  and be able to put the fix in so we can raid again!

I hope a fix to the remaining lag issues get resolved quickly as possible, such as the long zone times, the periodic broker lag and worst of all, the lag while fighting contested mobs in overland zones.

This is a PM from Rothgaar to a Dracos Argent member with some insight of the server system architecture:

We actually do have tools in place to monitor server health. It’s a pretty amazing tool and would be fun to show it to you and what all it can do. We also have people monitoring server status 24 hours a day so you’d think they would be made aware of this type of issue. Unfortunately with so many games to monitor and so many different types of health conditions to watch for, we have to create specific alerts to notify the Operations department when something is outside of ideal working parameters. We have many alerts created for things but in this particular instance, there was no alert that would have caught the problem. The first thing that came to mind when I saw the issue was “why don’t we alert on number of processes per machine?” So today you can bet I’ll be working with Operations to get that alert created so this won’t happen in the future. I’m sure there are other situations out there that we could be watching for as well, but its difficult to imagine some of them until they happen. This particular problem was a new one that had never occurred before. Ideally the quad-core servers should be running around 4 processes per machine so they each have their own dedicated CPU. But in this case some machines were running as many as 16 processes. It was pretty insane and no wonder why those processes were starved for CPU cycles.

 

The latest news

Update:

The downtime for Nagafen, Butcherblock, and LDL has been extended to around 10pm PDT this evening.

We apologize for the extended downtime.  As a result of this, the team has already extended the Moonlight Enchantments event an extra day to make up for all the time that these servers will be missing.

Thank you again for your continued patience.

This makes 22 hours estimated downtime.

 

Due to two mistakes in how the Server Downtime scripts were triggered last night, ALL servers must be rebooted. This will happen in a few minutes. The issue:

  • All West Coast servers, not just Butcherblock, Lucan D’Lere, Nagafen, were set to stop allowing instances (dungeons) from being created starting at noon today.

Announcement by Rothgar:

I just wanted to take a moment to explain some of the broadcasts you may have seen and what is happening with the servers.

Last night, prior to the maintenance for Butcherblock, Nagafen and Lucan D’lere, we scheduled the downtime which is responsible for notifying the worlds via broadcast and also safely shutdown and lock persistent instances until the worlds come down.

There were two mistakes made.  This notice was scheduled for all west coast servers, not just the three that needed it.  We also mistakenly set the time for today at noon instead of last night at midnight.  This is why BB, LDL and Nagafen didn’t get the notice last night.

Unfortunately once the servers have gone into maintenance mode and the instances become locked, there is no way to reset it unless we reboot them.

So we are planning on taking all of the west-coast servers down in 5-10 minutes for a reboot.  This will clear them so you’ll be able to create instances once again.

The reboot shouldn’t take long, maybe 10-20 minutes at most.  We apologize for this confusion.  Unfortunately some processes still require human interaction and occasionally we can make a mistake like this.

We hope to have all servers except for BB, LDL and Nagafen back up very shortly.  The DB migration is going well.  Nagafen is the only remaining DB to be copied and we’re hoping that it will complete in a few hours.  After the DB has been moved, we will need some additional time for QA to verify the migration.  Once we have the green light from QA, the remaining 3 servers will come back up.

Thanks for your patience!

 

I just wanted to take a moment to explain some of the broadcasts you may have seen and what is happening with the servers.

Last night, prior to the maintenance for Butcherblock, Nagafen and Lucan D'lere, we scheduled the downtime which is responsible for notifying the worlds via broadcast and also safely shutdown and lock persistent instances until the worlds come down.

There were two mistakes made.  This notice was scheduled for all west coast servers, not just the three that needed it.  We also mistakenly set the time for today at noon instead of last night at midnight.  This is why BB, LDL and Nagafen didn't get the notice last night.

Unfortunately once the servers have gone into maintenance mode and the instances become locked, there is no way to reset it unless we reboot them.

So we are planning on taking all of the west-coast servers down in 5-10 minutes for a reboot.  This will clear them so you'll be able to create instances once again.

The reboot shouldn't take long, maybe 10-20 minutes at most.  We apologize for this confusion.  Unfortunately some processes still require human interaction and occasionally we can make a mistake like this. 

We hope to have all servers except for BB, LDL and Nagafen back up very shortly.  The DB migration is going well.  Nagafen is the only remaining DB to be copied and we're hoping that it will complete in a few hours.  After the DB has been moved, we will need some additional time for QA to verify the migration.  Once we have the green light from QA, the remaining 3 servers will come back up.

Thanks for your patience!

 

Advertisements by Google

© 2011 EverQuest 2 Game Update Blog Suffusion theme by Sayontan Sinha