Ondanks de voorbereidingen liep het allemaal net wat anders als gepland. Zo waren er eigenlijk maar drie problemen die impact hadden.
We startten wat later dan gepland door een uitgelopen backup. Beter safe then sorry. Tijdens het omprikken van de kabels bleek dat de kast waarin de apparatuur stond precies met een poot op de vloertegel stond die ik er uit moest werken. Om dat goed voor elkaar te krijgen moest er om de kast heen extra tegels worden verwijderd zodat de eigenlijke tegel die ik er uit moest halen vrij kwam te liggen. Op zich niet erg maar ..
Door dat gebouw en geheister zat een kabel me hinderlijk in de weg. Dat ding hing niet weggewerkt vervelend te wezen. Omdat nu toch alles plat lag mooi een moment om de kabel door een ander te vervangen en weg te werken. Kabel opgesnort, ingeprikt, mooi weggewerkt en verder gaan.
En toen. tja.. toen bleek ineens dat dat rare roze kleurtje van die kabel niet voor niets was. Het duurde even voor ik me realiseerde dat die kabel een crosslink was. De oude kabel weer op de plek en alles werkte weer. (va. 11:51 uur)
Althans.. Op de een of ander manier ging het kopiëren van de data niet zo snel als we dachten. Waar een beetje snelheid nodig was bleek de copy slag traag te zijn, en waar we op een paar uur data pompen hadden gerekend werd het al snel een dikke 40 uur (is dus ergens maandag ochtend). Daarom op een noodplan overgestapt, en maken we de laatste stappen volgende week af.
En op het laatst (ik ben bijna aan bewaker 3 toe, ze werden telkens afgewisseld) bleek dat het hele printer systeem plat lag. De printerserver had een schop gekregen en aangezien onze printers hun configuraties uit de printerserver trekken ging dat fout. Na een kort telefoon gesprek met een van de locale jongens die dit soort dingen vaker bij de veter had gehad besluiten we alsnog om de printeromgeving een keer plat te gooien en dan op te starten. Langzaam aan (zeg een 5-10 min/ printer, we hebben er bijna 20 staan) kregen we ze stuk voor stuk weer aan de praat en aan het printen.
Dus de ingeschatte eindtijd van 1800 hrs. werd wat overtrokken…
Voor de geïnteresseerde, hieronder staat de hele listing (geanonimiseerd en ontdaan van specifieke tech info) van een hele dag waanzinnig lekker samenwerken op een site, met 4 landen (Nederland, Polen, België, Italië), serieus en minder serieus gegein.
een lange dag, maar met plezier gedaan. En… als het goed is merken de gebruikers er niets van.
De spelers hieronder: Italian (in de gesprekken Ig-1) : een van de project managers werkend vanuit Itailië, Polish-1 , Polish-2 en Polish-3 (respectievelijk Pg-1, Pg-2 en Pg-3 in de gesprekken): de poolse jongens in het poolse datacenter, werken met de VM ware machines, unix storage en Backup omgeving. C, K, A zijn testers/adviseurs die op afroep voor ons klaar stonden.
8:26 [ Italian ] Morning Den
8:26 [ Polish-1 ] Hi Den!
8:26 [ Polish-2 ] Hi Den
8:27 [ DenBolle ] Goodmorning everyone!
[Ig-1] is generous : he treats us all in
coffee!
8:27 [ Italian ] yes this for [Pg-2]
for [Pg-1]
8:28 [ Polish-1 ] thx
8:28 [ Italian ] for Den that has to wake up
8:28 [ DenBolle ] Gracie!
8:28 [ Italian ] yw
8:29 [ Polish-2 ] Thank you!
9:00 [ Italian ] Hi [Pg-3]
9:00 [ Polish-3 ] Hello
9:00 [ Italian ] please advise us when done
9:00 [ Polish-3 ] around 10 minutes left I think
9:01 [ DenBolle ] ah ok
9:01 [ Polish-3 ] blowing from home will not speed it up,
sorry
9:02 [ Polish-2 ] ;-)
9:02 [ Italian ] 9:02
9:02 [ DenBolle ] I'm in the office, I could spin up the
tape?
9:02 [ Italian ] good idea
9:02 [ DenBolle ] use a microwave to compress the data on
the tape?
9:03 [ Polish-3 ] Den, you are the biggest of us /except
[Pg-1]/ could you tell few words to the
Adic Lib to write a little bit faster ?
9:04 [ DenBolle ] sure..
[walks into server room with a stick and
blowtorch]
9:04 [ Polish-3 ] ;-)
9:04 [ Italian ] roftl
better ..... grin grin
9:06 [ DenBolle ] ok, I've discussed it with the lib, he's
appologizing and will work harder.
btw I never realized that they actually
can scream if you put them on fire.
9:06 [ Polish-3 ] buahahahaha
9:08 [ Polish-3 ] I though Adic is a woman..strange world..
9:10 [ DenBolle ] It screamed like a woman though..
9:14 [ Polish-3 ] btw started to write system state...
she is coming or he...
9:15 [ DenBolle ] she is coming.. this story get's a nice twist.
Go on [Pg-3]
9:16 [ Polish-3 ] heheh who will make some pictures - ->Den?
It is done gents!
9:16 [ DenBolle ] and put them on a site " www.hornytapelibs.com" ?
ok, let go on then
9:16 [ Polish-3 ] yep
9:17 [ Polish-2 ] it looks like I've missed some nice stories here
ok, I'm proceeding with stopping the BU1SMLXV0001
9:18 [ Polish-3 ] well..it will not be published on our webside
9:18 [ Polish-1 ] Can I start stopping VMs now?
9:18 [ Polish-3 ] logged off
9:19 [ DenBolle ] ok, I'm over to wireless to be safe.. just a se
9:19 [ DenBolle ] and back
9:20 [ Polish-3 ] [Pg-1] could you call me when the servers will be
back - I will start backups - I mean afternoon
9:20 [ Polish-1 ] Sure.
9:20 [ Polish-3 ] 504 110 321
cool,
9:31 [ DenBolle ] is this working
9:31 [ Polish-2 ] yep, now it's working
9:31 [ DenBolle ] mhm. glitch..
what did I miss?
9:31 [ Italian ] ;-)
9:31 [ Polish-1 ] I know. Some functional accounts logged
on the console. I need to power them down
manually
9:32 [ DenBolle ] last remark i've seen is froim [Pg-3] :
9:32 [ Polish-3 ] 504 110 321
cool,
9:35 [ Polish-2 ] I can't paste the OCS chat, because it
says that it is too large..
So to summarize it quickly: [Pg-3] logged off,
because his wife was chasing him. I've stopped
the BU1SMLXV0001 and [Pg-1] is stopping now
other VMs and ESX hosts.
9:36 [ Italian ] All Vms stopped ...great job [Pg-1]
9:36 [ DenBolle ] no problem.. So no strange things yet and
all goes as planned. Besides the huntdown of
[Pg-3]'s wife.
9:36 [ Italian ] ;-)
9:36 [ Polish-2 ] Maybe she heard the library's screams...
9:37 [ DenBolle ] tsk.. women united... If you hurt one ...
9:37 [ Polish-2 ] ;-))
9:38 [ Italian ] when finished I'll copy this chat and send
to Manager1, Manager2 and Manager3
with title "drunk men at work"
9:38 [ DenBolle ] grin grin.. and we never are allowed to
work on saterday..
9:44 [ Polish-1 ] @all: All VMs are powered down now. All
ESX hosts are shutted down and powered off now.
9:44 [ DenBolle ] ok, thx
9:44 [ Italian ] Den ..... go there and do your best
9:45 [ Polish-2 ] no no
now it's my part
please hold on
9:45 [ DenBolle ] [stopping]
9:45 [ Italian ] sorrrrrrryyyyyy
9:45 [ Polish-2 ] Den, I'll let you know
It will take approximately 15 mins
9:45 [ DenBolle ] I'll be waiting for you
[not to self : this chat's getting stranger
by the sentence]
10:01 [ DenBolle ] I've got screaming equipment
10:01 [ Polish-2 ] OK, all FC switches and FAS270`s are going down
or are already down.
This is the reason of those screams.
10:02 [ DenBolle ] ok, let me check in the room when they actually
die..
10:02 [ Polish-2 ] Let's wait now 2-3 min, just to be sure that all
went down.
10:03 [ DenBolle ] sure
10:03 [ Polish-2 ] After this 2-3 mins, I'd like to ask you, Den,
to power off the following devices:
BU2DSBS001
BU2DSBS002
BU1DSBS001
BU1DSBS002
BU2DSNTP001
BU2DSNTP002
BU1DSNTP001
BU1DSNTP002
so - all 4 FAS270's and all 4 FC switches.
10:04 [ DenBolle ] I'll do. I'll start at 10.05
10:05 [ Polish-2 ] ok
10:08 [ DenBolle ] ok, all shelfs are down (no more hd activity) :
I'll go shutdown the devices.
Do you want me to reconnect the fibers straight
away?
10:09 [ Polish-2 ] Did you power off the additional shelves as well?
10:09 [ DenBolle ] no, thats going to mbe done now
10:10 [ Polish-2 ] To be honest - it's probably not needed.
10:10 [ DenBolle ] if you feel safer when I do: no prblem.
10:10 [ Polish-2 ] OK - If you want to do so, we can do it. Just to
remember: later, when we will be powering it on,
we need to start them in reverse order.
So first of all: the additional disk shelves,
after that 2-3 min break to let the disks spin up
and afterwards the FAS270 controllerrs
10:12 [ DenBolle ] ok, I'm in, back in 10 -15 mins
10:12 [ Polish-2 ] one additional thing
10:12 [ DenBolle ] yes
10:12 [ Polish-2 ] I've set new IPs for BU2DSNTP001 & BU2DSNTP002.
So please reconnect also their ethernet cable to
the new subnet.
..and you have to reconnect the FC cable as well
as we discussed on the diagram.
10:13 [ DenBolle ] I'll do, stay tuned
10:13 [ Polish-2 ] th
thx
10:40 [ DenBolle ] ok, almost done, just one issue.
the 4 utp cables for the 270's : the core switch
is full, give me 20 mins to isolate cables we can
disconnect
10:41 [ Polish-2 ] 4 utp cables?
10:42 [ DenBolle ] yes the ethernet cables
10:42 [ Polish-2 ] to the BU2DSNTP001 & BU2DSNTP002 4 cables are
connected ?
10:42 [ DenBolle ] yes
10:42 [ Polish-2 ] they have dual ethernet ports, however only one port
(e0a) is used
10:43 [ DenBolle ] ok, in that case I'm still one port short : got one
emtpy one..
10:43 [ Polish-2 ] ok, I see
ok, we will wait
when you finish
you can power on following devices:
BU1DSBS001
BU1DSBS002
BU2DSNTP001
BU2DSNTP002
BU1DSNTP001
BU1DSNTP002
and please do not power on ESX servers
10:44 [ DenBolle ] No, I;ll do. ok, [Ig-1], in that case could you
serve all once more a cup of coffe in the mean
time?
10:46 [ Italian ] here they are ......
I'm boiling water
.........
and ..... that's it
11:03 [ DenBolle ] ok, done with the cables, powered up shelfs
(not FAS270's), and BU1DSB001 and BU1DSB002 are
starting. When these are finshed I'll boot
BU2NSNTP001/2 and BU1DSNTP001/2
11:03 [ Polish-2 ] great news
11:03 [ Italian ] [happy]
11:03 [ DenBolle ] at .05 I'll start
BU2NSNTP001/2 and BU1DSNTP001/2
11:04 [ Polish-2 ] FC switches haven't started yet.
11:04 [ DenBolle ] ok, when they have, ping me, then I'll boot the
FAS270's
11:04 [ Polish-2 ] ok
Still no progress.. usually, they are starting
quite fast.
11:06 [ DenBolle ] mhm.. what Can I check now?
11:07 [ Polish-2 ] Did you do the power-cycle of those switches?
I mean - were they powered off?
11:07 [ DenBolle ] they were
and disconnected power cables / reconnected
when I was done.
11:07 [ Polish-2 ] ok let's wait additional 1-2 mins
11:07 [ DenBolle ] ok
and if they aren't up.. that's were the terminal
connection kicks in?
11:09 [ Polish-2 ] I think that before that, we will do do the power
cycle once again.
11:09 [ DenBolle ] ok, I'll wait for your instrcutions
11:09 [ Polish-2 ] terminal connection was meant for BU2DSNTP001 & 002
- if they would be problems after changing IPs
11:10 [ DenBolle ] ah.. I understand
11:11 [ Polish-2 ] this is really wird
weird*
11:11 [ DenBolle ] eh....
don't go there..
11:11 [ Polish-2 ] Den, could you check ping to any other device from
affected subnet (192.46.157.*)?
11:11 [ DenBolle ] sure, let me reconnect to wired network, I'll ping
you when I'm in
11:12 [ Polish-2 ] ok
11:14 [ DenBolle ] ah.. not the cleverest move..
no dhcp server
11:14 [ DenBolle ] let me fetch it in a other way'
11:14 [ Polish-2 ] ok
11:16 [ Italian ] even without dhcp
192.46.157.* switched should ping,
or am I wrong?
switches
11:17 [ Polish-2 ] yes, you're correct
I'm trying to investigate the problem starting from
their internal network.
11:17 [ DenBolle ] ok, isolated the problem..
pinging internally works..
11:18 [ Polish-2 ] to FC switches as well?
11:18 [ DenBolle ] pinging outside (eg. my own site) no go...
ips for those?
11:18 [ Polish-2 ] 192.46.157.133
f.e.
11:18 [ DenBolle ] jep, no problem...
11:19 [ Polish-2 ] ok, so we have some network problem..
11:19 [ DenBolle ] and default gateway also replies : .213
so let me rerun my cabling and I'll get
back in say 10 mins
11:19 [ Polish-2 ] ok, thx
11:30 [ DenBolle ] cables seems in order, the riverbed device seems to
have an issue. So I rebooted it, but it doesn't show
links... what IP's did we use? perhaps a msitake
and used a ip from the riverbed?
11:31 [ Polish-2 ] I've used following IPs for BU2DSNTP001 & 002:
192.46.157.166
192.46.157.167
However, they aren't even up.. So this is probably
not the issue.
11:31 [ DenBolle ] agree.
I can ping internally, no issues.
outside : doesn't work
how can we verify the correct function on the
riverbed?
11:33 [ Polish-2 ] Unfortunately, I am not a network specialist and do
not have any knowledge about that...
[Pg-1], [Ig-1] - any ideas?
11:33 [ Polish-1 ] No idea at all.
11:34 [ Italian ] I'm looking for someone from network team ......
11:34 [ DenBolle ] ok
11:34 [ Polish-2 ] from Poznan, they are all offline.
11:35 [ DenBolle ] wait just a second
to get the cable truogh I had to reconnect one cable
on the back of the riverbed, let me fetch the cabling
diagram
11:37 [ Polish-2 ] ok
11:50 [ Italian ] I don't understand
we don't touch any netw appliance
only FC switches ....
11:51 [ Polish-2 ] Den mentioned that he was doing reorganization
in LAN switches (and/or Riverbed device) -
this probably is causing problem
YES!!!
FC switch pingable!
11:51 [ Italian ] it works
Great job Den
!!!!
11:52 [ DenBolle ] pfoe..
my mistake!
11:52 [ Polish-2 ] :-))
11:52 [ DenBolle ] ok, back to business.. : I'll power up the FAs270?
11:53 [ Polish-2 ] yes, please
11:53 [ DenBolle ] ok, just a sec..
[notes to self to stay the hell away of those
cables.. ]
done, bootup in progress
11:54 [ Polish-2 ] great, I'm monitoring the.. ping to them.
let's hope that here we won't have problems...
11:54 [ DenBolle ] don't worry.. I know what went wrong..
network 101
11:54 [ Polish-2 ] hell yeah!
BU2dsntp001 and BU1dsntp001
up
11:55 [ Italian ] 2 good news in 2 mins .... my heart could stop
11:55 [ DenBolle ] har har..
we're a good team, it's dun to work like thois
this
dun = fun..
12:00 [ Polish-2 ] ok, the whole storage booted just fine..
now I'm going to make mess with it
12:00 [ Italian ] ok
12:02 [ DenBolle ] so at this point, we about 2,5 hrs behind on
schedule..
12:02 [ Italian ] yes
12:02 [ DenBolle ] sorry.
12:02 [ Italian ] np
How long [Pg-2]?
12:02 [ Polish-2 ] i think max. 30 min
12:03 [ Italian ] great
12:03 [ DenBolle ] So you're making up for my mistake, and then
we're back on schedule..
cool
12:03 [ Italian ] don't worry [Pg-2] even 1 hour
12:03 [ DenBolle ] no, I'll stay onsite as long as is needed.
12:29 [ Polish-2 ] OK - seems to be done.
LUNs from BU2 storage are remapped to the new ESX
hosts with IDs starting from 10.
Zoning on the FC switches is also reconfigured.
12:31 [ DenBolle ] ok, so next steps are powering up ESX hosts?
12:31 [ Polish-2 ] [Pg-1] - I think that you can start ESX hosts
now. Please check the connection to the storage
before starting up the VMs.
12:31 [ Polish-1 ] OK. Please let me celebrate for a while this moment
when world's eyes are on me.
12:31 [ Polish-2 ] haha
12:32 [ DenBolle ] hahaha. Big aplause for you !
12:34 [ Polish-2 ] I've made it in 30 mins - I can afford to eat
breakfast now.
12:34 [ Polish-1 ] I'll power them up one by one.
BU1sxve001 goes first
12:34 [ Polish-2 ] OK, I will be watching.. on the FC switches.
As per FC point of view - it's up. And each Netapp
can see it.
12:59 [ Polish-1 ] all 3 ESX host are up but they can't see new LUNs.
I performed rescan onstorage but it not helped.
I need to check this.
13:01 [ Polish-2 ] I can only say that from my side all seems to be
correct. Netapps see each ESX and those ESX are logged
in on Netapp adapters.
FYI: There should be 4 new LUNs visible,
with IDs 10,11,12,13.
13:03 [ DenBolle ] ok, np. take your time to find out whats going on.
14:05 [ Polish-1 ] OK. Now I can see all LUNs on all 3 ESX hosts.
@[Pg-2]: We need to verify paths now.
Please provide me paths for LUN S1V1 (108,5GB)
14:07 [ Polish-2 ] Sure thing.
You have to to tell me this LUN id.
Because we have now double pairs of S1V1,
S1V2, and so on...
14:08 [ Polish-1 ] LUNs from BU2echt are described as:
S1V1_BU2
S1V2_BU2 and so on
14:09 [ Polish-2 ] ok, so first the LUNs from BU1ST Netapps:
14:09 [ Polish-1 ] S1V1
14:10 [ Polish-2 ] S1V1 (880.1g, LUN id: 0)
S1V2 (805g, LUN id: 1)
S1V3 (563.1g, LUN id: 4)
Correct path for them is: 500a098195065a78
14:10 [ Polish-1 ] This should be set as prefferd right?
and active
14:11 [ Polish-2 ] S2V1 (955.1g, LUN id: 2)
S2V2 (730.1g, LUN id: 3)
S2V3 (563.1g, LUN id: 5)
Correct path for these is: 500a098185065a78
Correct. It should be considered as preferred
(primary) path.
BU2 LUNs:
S1V1 (840g, LUN id: 10)
S1V2 (840g, LUN id: 11)
Correct path: 500a098195364ca3
S2V1 (840g, LUN id: 12)
S2V2 (840g, LUN id: 13)
Correct path: 500a098185364ca3
Uff.. that's all.
14:14 [ Polish-1 ] ok please give me a few minutes to check all hosts
14:29 [ Polish-1 ] Ok. I verified everything and corrected paths.
Now I suggest to power on domain controller.
VM in BU1.
14:29 [ Italian ] ok for me
14:30 [ Polish-1 ] then I'll add to inventorythose machines which were
migrated.
14:31 [ DenBolle ] ok, well, so far all goes well.
14:32 [ Polish-1 ] Den, an all BU1st machines the second DNS entry is
set to the BU2 DC. I suggest to change it to
STVSXDMC018 147.184.54.199 and the primary DNS leave as
is (BU1SXDMC002).
14:33 [ DenBolle ] ok, do so, do you want me to submit a ticet for it?
14:33 [ Polish-1 ] no.
14:36 [ Italian ] [Pg-1], about BU2SDWN001 (FV) how long does it take
to add to inventory ?
14:36 [ Polish-1 ] it should be one click.
I'll do it now.
14:37 [ Italian ] thks
14:40 [ Italian ] I see it
14:43 [ Italian ] please remember to change IP
14:50 [ Polish-1 ] BU2SDWN001 is working under new IP now.
Now I will add BU2SXFILE001 to the inventory
and change the IP.
Then I'll call DNS team to perform DNS changes. OK?
14:51 [ DenBolle ] ok
14:51 [ Italian ] moment
DNS is already updated ?
14:51 [ Polish-1 ] no
14:51 [ Italian ] ok
go ahead
and I told C. to wait
for tests
14:51 [ Polish-1 ] ok
15:23 [ Polish-1 ] I powered on BU2SXFILE001 and changed IP. Then I added
a new disk to BU1SXFILE001 and powered it up.
Now I catch up with network team for DNS changes.
15:23 [ Italian ] ok
15:25 [ Italian ] ANy idea about new ip propagation time ?
15:29 [ Polish-1 ] [Pg-2]osz Ostrowski is workin on it now. He does not
know how log it may take as he's not doing it usually.
Only today was caought by me.
cought*
15:31 [ Italian ] because I suppose you already started DNS change
activity, from my experience it could take also
24 hours to propagate
I don't know if is possible to force it
15:31 [ Polish-1 ] I provided him numbers of those 2 requests.
15:33 [ DenBolle ] and a forced propagation isn't a option?
15:33 [ Italian ] Den What's your idea about FV tests, in the worst case
can we done them tomorrow ?
15:35 [ DenBolle ] I think we can, let me verifyt with C. on this one.
Its's possible to do it from home for him
15:39 [ Polish-1 ] So far from Poznan wi have this:
C:\>ping BU2sdwn001
Pinging BU2sdwn001.lan01.com [192.46.157.162] with
32 bytes of data:
Reply from 192.46.157.162: bytes=32 time=63ms TTL=119
Reply from 192.46.157.162: bytes=32 time=63ms TTL=119
Reply from 192.46.157.162: bytes=32 time=64ms TTL=119
Reply from 192.46.157.162: bytes=32 time=63ms TTL=119
Ping statistics for 192.46.157.162:
Packets: Sent = 4, Received = 4, Lost = 0 (0% loss),
Approximate round trip times in milli-seconds:
Minimum = 63ms, Maximum = 64ms, Average = 63ms
15:41 [ Italian ] the same from Italy ....
15:42 [ DenBolle ] I've talked with C., tomorrow testing is possible,
but before 1000 hrs, he has other apointments
15:42 [ Italian ] I think he caould start test now
DNS looks updated
15:43 [ DenBolle ] ok, I'll inform him.
15:43 [ Italian ] thks
15:43 [ Polish-1 ] I'm gonna start next VMS
ok?
15:43 [ Italian ] yes please
15:44 [ DenBolle ] C. wil start testing right away.
16:00 [ Polish-2 ] [Pg-1], are you starting the other VMs as well?
I need to check the BU1SMLXV0001 server.
16:01 [ DenBolle ] C. reports an issue : mq service isn't running,
I'm working on that.
16:02 [ Italian ] ok in case we can open a ticket to GSA-WW-MIDDLEWARE
16:07 [ DenBolle ] ok, problem killed.. C. tests further.
16:07 [ Italian ] good job "terminator"
16:16 [ Polish-1 ] I'm starting VMs one by one.
16:16 [ DenBolle ] ok, al seems in good working order ?
16:17 [ Polish-1 ] so far so good
16:17 [ Italian ] [happy]
16:17 [ DenBolle ] you optimistic guy
16:32 [ Polish-1 ] BU1SMLXV0001 is up now
16:32 [ Polish-2 ] thanks, I've just finished checking it
all seems to be fine, all filesystems mouted,
oracle processes started
By the way, after the action, we need to raise a high
priority ticket to DBA team with request to check
databases on affected servers. I discussed this
with [naam] from DBA, and he asked me to do so.
16:34 [ Italian ] good idea
16:35 [ Polish-2 ] How about other VMs? Did they started properly?
No issues yet?
16:35 [ DenBolle ] C. reports no issues on FV and MQ series, [Ig-1],
he'll sent you a test report.
16:35 [ Italian ] fantastic
16:36 [ Polish-1 ] I'm checking them one by one.
We'll need to ask [Pg-3] to check if backup works
fine now.
16:36 [ Italian ] [Pg-1] did you start data copy from BU2SXFILE001 ?
16:37 [ DenBolle ] should his wife be done with hunting now?
16:37 [ Polish-2 ] We'll see.. ;
16:37 [ Polish-1 ] not yet. I need to prepare jobs for SecureCopy.
Fortunatelly I installed it yesterday.
My wife?
16:38 [ Italian ] [Pg-3]'s wife
16:38 [ DenBolle ] No, [Pg-3]'s wife
16:38 [ Polish-1 ] Also Lotus team could be informed to check Domino
server.
16:38 [ Italian ] Den
Can I promote you leader?
16:39 [ DenBolle ] Sure, does it pay more?
16:39 [ Italian ] 65% more
16:39 [ DenBolle ] Give it to me!
16:39 [ Italian ] or 67% don't remember
I really have to go ........
but I'm available on call (if necessary , but i
don't think so, your are in good hands )
16:41 [ DenBolle ] No problem. I have your mobile.
[realises I'm the guy for the coffee now.. ]
16:42 [ Italian ] ehehehe
I'm happy that FV tests were ok
16:43 [ DenBolle ] me to, specially beacuse that's something automated
outside us, and BU2 depend on it..
16:43 [ Italian ] yes
Thanks a lot [Pg-1] and [Pg-2], I didn't do nothing
"today" but I wish you feel my support remotly
16:45 [ Polish-1 ] Thank you too.
16:45 [ DenBolle ] [Ig-1], thx sofar for your help and coordination.
When we're done I'll sent you a txt message to inform
you.
16:45 [ Polish-2 ] Sure, [Ig-1]. It was good to have you here. Thank you!
16:45 [ Italian ] too good guys
Thanks Den a call or a message will be really appriaciate
16:46 [ DenBolle ] I'll do, count on it.
16:46 [ Italian ] grazie
bye bye
16:46 [ Polish-2 ] bye
16:46 [ DenBolle ] bye
16:47 [ Polish-2 ] [Pg-1], are all VMs up by now?
If yes, I would raise ticket to DBA team.
16:48 [ DenBolle ] ok, just a short next-steps overvieuw : after al VMS are
up, db checks, inform notes team, and run a test backup.
Anything else?
16:49 [ Polish-2 ] from my side, nothing to add
16:49 [ DenBolle ] ok, [Pg-1]?
16:51 [ Polish-1 ] and copy BU2SXFILE001 to BU1SXFILE001
this can last few hours
16:52 [ DenBolle ] ok, and all would take about say 2-3 hrs from now?
(I've got to inform security here that it's going to
be a little later than 1800 hrs)
16:52 [ Polish-1 ] no idea. It's 500GB to copy.
16:53 [ DenBolle ] thats (500 diveded by 16M memory sticks)
say quite some later than 1800 hrs. I'll keep it
at 2000 hrs ok?
16:54 [ Italian ] [Pg-1] could you be there unil the end of the copy ?
16:54 [ Polish-1 ] ok. If it's not finished until that time you can go
home and I'll let you know via sms if all data is copied.
I'll be here until the end of the copy.
16:55 [ DenBolle ] No, I can stay as long as needed, but I would like to
inform the security guy before he's disapointed.
16:55 [ Italian ] because last step is to stop BU2SXFILE001
16:55 [ DenBolle ] /scared because [Ig-1]'s ghost is here..
16:55 [ Italian ] eheheheh
I'm waiting for my wife
16:56 [ Polish-1 ] I know but I have no info from K. if he checked if
we have access to every file. If I need correct
something manually it may last longer.
16:56 [ Italian ] she has to drive me home
16:57 [ DenBolle ] ok, for that lets see what happens when the copy
part is starting. And have a chat about that then.
First lets see that we have the DB's checked and
Domino team informed.
16:57 [ Polish-1 ] one moment. I have an urgent reboot in Canada.
16:58 [ Polish-2 ] Den, I am wondering whether you could raise a ticket
to DBA team when all VMs are up?
16:58 [ DenBolle ] sure, np, give me the RA and I'll submit one
16:58 [ Polish-1 ] in a minute or two
16:59 [ Italian ] TECH-CENTRE-DATABASE-SUPPORT
16:59 [ Polish-1 ] right
16:59 [ Polish-2 ] if this is a minute or two, I think that I will
manage to do that by myself.
17:00 [ Polish-1 ] ok
all VMs are up now
17:02 [ DenBolle ] allright, who's going to submit the ticket?
to prefent a double entry.
17:02 [ Polish-2 ] I am going to submit the ticket to DBA team.
17:02 [ DenBolle ] ok
17:05 [ Italian ] good luck and hope you finish asap
17:05 [ DenBolle ] Thx, and snjeoy your evening!
snjeoy = enjoy
17:06 [ Italian ] :-)
17:06 [ Polish-2 ] Ticket to DBA: UKIM20002972635
17:07 [ DenBolle ] ok, let see if they pcik it up quickly
17:08 [ Polish-1 ] I talked to [Pg-3]. He'll check backups about * PM
today and he'll send us an email with update.
17:08 [ DenBolle ] ok, so that's covered. What about Domino team?
17:08 [ Polish-1 ] I'm creating a ticket now
17:09 [ DenBolle ] Great, this is going well guys, thx a lot for
your effort on this one.
17:11 [ Polish-1 ] UKIM20002972653
17:11 [ DenBolle ] ok, thx
17:32 [ Polish-1 ] OK I started copying data from BU2SXFILE001
to BU1SXFILE001
1 min 200 MB
2 min 410 MB
3.73 MB/s
17:34 [ DenBolle ] mhm..
if I calculate right 200 mb / min = 1Gb/ 5 min
--> 12Gb / hrs..
17:35 [ Polish-1 ] looks like this
17:35 [ DenBolle ] ie.. whole copy will take approx 40 hrs..
and not ready monday morning..
17:36 [ Polish-1 ] 5 min and 1.16GB
now performance is about 4 MB/s
17:38 [ DenBolle ] A. (messaging team) tells that domino is working fine..
17:38 [ Polish-1 ] ok
17:39 [ Polish-2 ] No one from DBA haven't picked up the ticket yet.
I'll try to contact them now.
17:40 [ DenBolle ] ok.
[Pg-1] : lets wait a litte to see if it speeds up,
but 4mb/sec is to slow to get ready
17:41 [ Polish-1 ] ok now I have 3.61MB /s
and 2.17GB copied
we can leave BU2SXFILE001 working and correct DNS entry.
The migration can be done later if the server is
accessable for users after migration. What you think?
17:43 [ DenBolle ] like having a shadowcopy in place and then just a smal
migration?
17:44 [ Polish-1 ] migrate files during the week and during the weekend
copy only files which changed.
17:44 [ DenBolle ] yeah, that's the best approach I think,
sync the files, and switch in the next weekend.
how does that goes with the change we requested?
can we fit it in?
17:45 [ Polish-1 ] We can do an exception. it's always possible.
3.34MB/s now
17:46 [ DenBolle ] ok, so the chance that it will speed up significant
is low, let's go for the 'hidden sync' and migrate
next weekend then.
in this speed we're doomed
17:47 [ Polish-1 ] ok so I need to rise a ticket for a DNS entry change.
Then we will check if it works. I'll keep copying task
running during the weekend to have the main part
already copied.
A new change request will be required but I'll
prepare it.
17:48 [ DenBolle ] ok, and the DNS is only for the BU2SXFILE001 right?
17:48 [ Polish-1 ] right
this will point now to the new address
192.46.157.168 right/
17:49 [ DenBolle ] let me fetch that one, it...
17:56 [ DenBolle ] just a sec again.. I've got .164 ..
let me fect the documentation to [Ig-1]
17:56 [ Polish-1 ] ok
17:57 [ DenBolle ] The last IP address 192.46.157.168 has been required as
a temporary address in case of any issue on migrated VMs.
(Temporary emergency Vms component)
17:58 [ Polish-1 ] From changerequest:
3. 15 min. Modify the IP address and default
gateway of BU2SDWN001 as per
configuration items changes.
Change the IP of BU2SXFILE001 to
192.46.157.168 and apply BU1 network
settings.
To provide screen shot of new IP
config.
17:58 [ DenBolle ] yeah, it's ok. use it
17:59 [ Polish-1 ] ok so I'm rising a ticket for DNS change.
17:59 [ DenBolle ] wait..
17:59 [ Polish-1 ] ok
18:03 [ DenBolle ] mhm.. this one isn't clear. the doc from [Ig-1]
(23/12/2010) states .164 as FS ip.
and the 168 as emergency...
Go ahead: use .168: this is an emergency.
18:03 [ Polish-1 ] ok
18:04 [ DenBolle ] I'll update [Ig-1] on this per email.
18:08 [ Polish-1 ] 7002306
18:10 [ DenBolle ] ?
18:10 [ Polish-1 ] request number
to network for DNS change
18:10 [ DenBolle ] ah ok
[Pg-2], any luck on the DBA team?
18:13 [ Polish-2 ] not yet, I wrote a SMS to person on-duty,
without any luck yet
I'll wait additional 5-10 mins and I will call them
18:13 [ DenBolle ] ok, in that case I'll hand out the coffee
18:13 [ Polish-2 ] ok
18:14 [ DenBolle ] for [Pg-1], and this one for you [Pg-2]osz
18:14 [ Polish-2 ] thx a lot
18:14 [ DenBolle ] mail from [Pg-3] :
Hello All,
Just FYI. The backup service is OK after the migration.
I have just started all quarter jobs except BU1SXFILE001
that is during data copy. So far so good for the backup
side.
Regards,
[Pg-3]
18:14 [ Polish-2 ] good
18:26 [ Polish-2 ] weird.. the guy who is supposed to be on-duty from DBA
team is not answering neither the SMS and the phone calls..
18:26 [ DenBolle ] and a 'second' one isn't available?
18:27 [ Polish-2 ] I will try to call the 'second' one
18:31 [ Polish-1 ] 21,72Gb in 1 hour
18:31 [ DenBolle ] hey..
18:31 [ Polish-2 ] so it seems that speeded up?
18:32 [ DenBolle ] thats twice what we expected..
18:32 [ Polish-2 ] 25h for 500GB
18:32 [ DenBolle ] ie.. tomorrow al is done?
18:32 [ Polish-1 ] now 6.17 but we don't know how many small files are befor
us
18:32 [ DenBolle ] if no files have issues that is.
18:32 [ Polish-1 ] no it's about 26 hours
it's still risky
18:33 [ DenBolle ] lets keep the BU2SXFILE001 alive.. to be save.
18:33 [ Polish-1 ] yes
18:34 [ DenBolle ] and stick to the plan. A dns change is easy.
If all is save we can do it perhaps earlier then next
weekend.
18:36 [ Polish-1 ] sure
18:49 [ Polish-2 ] I've contacted [naam] from DBA at last..
He has muted his phone and left it in thie jacket's
pocket.. and didn't hear that it was ringing.
He will be on-line in a minute and check DBs.
18:50 [ DenBolle ] pfoe.. I assume we can recognize [naam] at his black
eyes monday?
19:00 [ Polish-1 ] C:\>ping BU2sxfile001
Pinging BU2sxfile001.lan01.com [192.46.157.168] with
32 bytes of data:
Reply from 192.46.157.168: bytes=32 time=95ms TTL=119
Reply from 192.46.157.168: bytes=32 time=92ms TTL=119
Reply from 192.46.157.168: bytes=32 time=77ms TTL=119
Reply from 192.46.157.168: bytes=32 time=75ms TTL=120
Ping statistics for 192.46.157.168:
Packets: Sent = 4, Received = 4, Lost = 0 (0% loss),
Approximate round trip times in milli-seconds:
Minimum = 75ms, Maximum = 95ms, Average = 84ms
C:\>
19:01 [ DenBolle ] great thx. Solved this one. After DBA verification we're
done for today ?
19:02 [ Polish-1 ] Please check if you can map shares without problems.
If yes, we can say that's all for you today.
19:07 [ DenBolle ] BU2sxfile001 resolves to 192.46.157.160 --> BU1sxfile001
when was the dns set?
19:08 [ Polish-1 ] I talked to this guy. This will be set in a minute.
let's wait for a while. They also have a tough day.
19:08 [ DenBolle ] sure, np..
19:23 [ DenBolle ] on the BU1SXFILE001 the BU2SXFILe001 resolves to
192.46.157.168, but on the network it doesn't,
I'll wait some more.
19:24 [ Polish-1 ] [Pg-2] is working on it now.
19:25 [ DenBolle ] sure, no hurries on this one
19:25 [ Polish-1 ] He had a call to Riga
network outage till monday
19:25 [ DenBolle ] in Riga?
19:25 [ Polish-1 ] yes. Packets get lost from time to time.
my remedy queue is blue
19:26 [ DenBolle ] because you're working all day allready ?
19:26 [ Polish-1 ] yes
19:27 [ DenBolle ] grin grin..
just keep up a little.. we're almost done for today
19:27 [ Polish-1 ] you are almost done
I'm till midnight
19:28 [ DenBolle ] wow.. thats 18 hrs ?
19:28 [ Polish-1 ] 17
19:28 [ DenBolle ] what happend?
19:28 [ Polish-1 ] when we are on remedy during weekend we have to proceed
all weekend's tasks that's all.
my weekend shift is between 12PM - 24
because today we had BU2migration I had to start earlier.
There was an option
KL...
19:29 [ DenBolle ] in that case : my compliments ! I appriciate it that
you supported us outside your regular shift times.
19:30 [ Polish-1 ] It's always better to have work in one heands that
pass it on and on from person to person.
19:31 [ DenBolle ] I know what you mean.. losing that one detail and then
no one knows what happened.
19:31 [ Polish-1 ] eg.
19:39 [ DenBolle ] allright : in the network the BU2sxfile001 in the
bu1 subnet works at 192.46.157.168 and mapping is
possible. I'm now going to the bu2 network to verify a login
19:40 [ Polish-2 ] A. just sent the info about DBs: all is fine.
In that case, I think that my work for today is done.
19:42 [ Polish-1 ] ok. thank you for your assistance. Have a nice sleep.
19:44 [ Polish-2 ] Thank you for your support. I wish you [Pg-1] a calm
on-duty time. I hope that you'll be able to relax tomorrow.
19:44 [ Polish-1 ] thanks, I hope so.
20:19 [ DenBolle ] ok, the followme system has issues. I've shut down
all printers and will restart them one by one
20:19 [ Polish-1 ] ok and now?
20:20 [ DenBolle ] I wait a few mins to calm the server.
It can't handle the amount of config requests,
and then I'll boot up the printers again
20:20 [ Polish-1 ] ok
20:22 [ Polish-2 ] OK, I'm logging off now.
@Den: Thanks for cooperation. I hope that all remaining
issues will be resolved soon.
Thank you guys, have a nice Sunday!
bye
20:22 [ Polish-1 ] bye! Thank you!
20:43 [ DenBolle ] [Pg-1], I need a reboot on the BU1SPWN001 ...
the priner server isn't acting wel
20:46 [ Polish-1 ] ok I'll do it right now.
20:46 [ DenBolle ] great
20:50 [ DenBolle ] and?
20:50 [ Polish-1 ] server is booting up now
20:50 [ DenBolle ] top, thx
20:53 [ DenBolle ] ah.. it's also the DHCP server...
20:53 [ Polish-1 ] yes
20:53 [ DenBolle ] lost my connection
I'm on wifi, is it back on ?
20:55 [ Polish-1 ] yes the server is back on
20:56 [ DenBolle ] ok, let me see that I'm getting the printer back on
20:56 [ Polish-1 ] ok
21:48 [ DenBolle ] ok, I'm back.. All printers tested and are working.
21:48 [ Polish-1 ] ok good.
what next?
21:48 [ DenBolle ] beer?
21:48 [ Polish-1 ] I'm at work
21:49 [ DenBolle ] I've got to drive the motor for 1,5 hrs...
then beer
We're done I think. Db's checked, mappings checked,
printers checked, all works, only thing now is
copying the data
21:50 [ Polish-1 ] 83Gb so far and 5.52 MB/s
21:50 [ DenBolle ] so we're done in the majority of task
21:50 [ Polish-1 ] I think yes. I'm happy it worked.
21:54 [ DenBolle ] me to.
Thx a lot for all. I'll inform the users.
Next weekend the last step ?
21:55 [ Polish-1 ] I think yes. I'll contact you in the week.
Thanks you. Have a nice end of weekend.
21:55 [ DenBolle ] ok, excellent. Thanks for your help and
patience. Hear you next weekend.






Auteur DenBolle | Datum: 23-1-2011 |


Cookie policy
Daar zaten wel een paar momenten tussen waar je het zweet van in de handen krijgt 🙂
Vooral vm’s met multipathing, dan moet je de kop er wel even rustig bij houden.
Onder het mom van : als je het niet wil dan wel en als je het wel wil dan niet…
(Heuh? : welnu komtie:)
We hebben hier een 4-tal oude ml350 (proliants) staan, met allerlei raid combinaties, 78 en 36 GB verdeeld over verschillende raid sets op verschillende servers. Die disken moeten leeg, dus ik ‘schud’ ze even. (Alles eruit, door elkaar husselen en hop, random terug duwen en laten initialiseren…) Hoe het kan weet ik niet, maar toch mooi dat ik stomverbaasd 3 minuten later naar een windows 2003 server inlogscherm stond te kijken.
Wel een Blue screen natuurlijk na het inloggen, maar toch! (zie je wel,ik doe ook spannende dingen 🙂 )