Welcome to the world of Unix, Virtualization, Backup and Storage!!!!!!!!!
Friday, 29 April 2011
solutions for solaris,backup and storage: DYNAMIC RECONFIGURATION IN E20k
solutions for solaris,backup and storage: DYNAMIC RECONFIGURATION IN E20k: "Today I learnt a very interesting thing,that i got a chance for Dynamic Reconfiguration in E20k.For these many days i have heard about board..."
DYNAMIC RECONFIGURATION IN E20k
Today I learnt a very interesting thing,that i got a chance for Dynamic Reconfiguration in E20k.For these many days i have heard about board replacement in E20k Oracle Solaris Enterprise Server.I was a bit worried as our soul of production was running on this box, more than that a single board of E20k costs Rs.60-70lakh.In case if any error happens...god its end of story.and now i don't know how i did that...
In our E20k box we have about 5 domains running with siebel Database taking 5 boards and 2 IO.All the domains are configured in Sun Cluster.I was asked to take a board from Siebel (SB6) and add it to another domain running SAP database.The board to be replaced was with 32 Gb
In this scenario:
C domain is Siebel
D domain is SAP.
Before Starting the activity i made a copy of all important outputs like df -h, ifconfig -a, /etc/vfstab, /etc/system, /etc/hosts, metadb, metastat, metaset -a, metainit 'metaset', scstat -g, scstat -q, scstat -i, scstat, scdidadm -L,scdidadm -l, scconf -pv, dfshares, prtconf -pvv, prtdiag -v format and almost all the outputs neccessary.
Below i have mentioned everything in a step by step method:
1)first i logged into domain siebel and checked the memory size with a 3rd party tool 'TOP'
root@PSBLD008 # top
[H [Jlast pid: 22624; load averages: 3.27, 3.32, 3.06 [1;149H04:12:07
693 processes: 685 sleeping, 2 zombie, 2 stopped, 4 on cpu
CPU states: % idle, % user, % kernel, % iowait, % swap
Memory: 96G real, 75G free, 15G swap in use, 102G swap free
PID USERNAME LWP PRI NICE SIZE RES STATE TIME CPU COMMAND
5648 dbmonito 46 0 2 12G 8296M cpu/130 64:13 1.73% oracle
19285 dbmonito 1 0 2 12G 8382M cpu/96 19:54 1.33% oracle
19283 dbmonito 1 0 2 12G 8382M cpu/133 19:10 1.22% oracle
19281 dbmonito 11 35 2 12G 8384M sleep 1:32 0.66% oracle
19279 dbmonito 11 35 2 12G 8382M sleep 1:14 0.57% oracle
23560 oracle 1 60 0 12G 8279M sleep 109:17 0.41% oracle
16271 oracle 1 60 0 12G 8281M sleep 51:05 0.21% oracle
21497 oracle 1 22 0 12G 8276M sleep 344:45 0.16% oracle
23987 oracle 1 60 0 12G 8283M sleep 34:20 0.13% oracle
22925 oracle 1 40 0 12G 8276M sleep 341:59 0.08% oracle
20787 oracle 1 43 0 12G 8277M sleep 257:22 0.07% oracle
22624 root 1 3 0 3472K 2432K cpu/160 0:00 0.06% top
18093 oracle 1 41 0 12G 8277M sleep 258:14 0.05% oracle
1651 oracle 1 40 0 12G 8278M sleep 193:11 0.04% oracle
22222 oracle 1 42 0 12G 8287M sleep 0:07 0.04% oracle
(you can check this using command prtdiag -v |grep -i memory, or any other command to check memory)
we can notice that total memory is 96 Gb.
-------------------------------------------------------
2)login to the console of E20k.the console is called SMS.and here u need to 'su' to sms-svc user to do sms operations,which is a user by default in console.
And check for all the platforms
root@T-Sky-20K-2-sc1 #su - sms-svc
T-Sky-20K-2-sc1:sms-svc:1>
T-Sky-20K-2-sc1:sms-svc:1>bash
bash-2.05$ showboards Retrieving board information. Please wait. ........ Location Pwr Type of Board Board Status Test Status Domain -------- --- ------------- ------------ ----------- ------ SB0 - Empty Slot Assigned - A SB1 On V3CPU Active Passed A SB2 On V3CPU Active Passed B SB3 On V3CPU Active Passed C SB4 On V3CPU Active Passed C SB5 On V3CPU Active Passed C SB6 On V3CPU Active Passed C SB7 On V3CPU Active Passed D SB8 On V3CPU Active Passed E SB9 - Empty Slot Available - Isolated SB10 - Empty Slot Available - Isolated SB11 - Empty Slot Available - Isolated SB12 - Empty Slot Available - Isolated SB13 - Empty Slot Available - Isolated SB14 - Empty Slot Available - Isolated SB15 - Empty Slot Available - Isolated SB16 - Empty Slot Available - Isolated SB17 - Empty Slot Available - Isolated IO0 On HPCI+ Active Passed A IO1 On HPCI+ Active Passed A IO2 On HPCI+ Active Passed B IO3 On HPCI+ Active Passed B IO4 On HPCI+ Active Passed C IO5 On HPCI+ Active Passed C IO6 On HPCI+ Assigned Failed D IO7 On HPCI+ Active Passed D IO8 On HPCI+ Active Passed E IO9 - Empty Slot Available - Isolated IO10 - Empty Slot Available - Isolated IO11 - Empty Slot Available - Isolated IO12 - Empty Slot Available - Isolated IO13 - Empty Slot Available - Isolated IO14 - Empty Slot Available - Isolated IO15 - Empty Slot Available - Isolated IO16 - Empty Slot Available - Isolated IO17 - Empty Slot Available - Isolatedbash-2.05$ showplatform PLATFORM: ========= Platform Type: Sun Fire E20K CSN: ==== Chassis Serial Number: 0609AK20BA COD: ==== Chassis HostID: 5014936D87943 Proc RTUs installed: 0 PROC Headroom Quantity: 0 Proc RTUs reserved for domain A: 0 Proc RTUs reserved for domain B: 0 Proc RTUs reserved for domain C: 0 Proc RTUs reserved for domain D: 0 Proc RTUs reserved for domain E: 0 Proc RTUs reserved for domain F: 0 Proc RTUs reserved for domain G: 0 Proc RTUs reserved for domain H: 0 Proc RTUs reserved for domain I: 0 Proc RTUs reserved for domain J: 0 Proc RTUs reserved for domain K: 0 Proc RTUs reserved for domain L: 0 Proc RTUs reserved for domain M: 0 Proc RTUs reserved for domain N: 0 Proc RTUs reserved for domain O: 0 Proc RTUs reserved for domain P: 0 Proc RTUs reserved for domain Q: 0 Proc RTUs reserved for domain R: 0 Available Component List for Domains: ===================================== Available Component List for domain A: No System boards No IO boards Available Component List for domain B: No System boards No IO boards Available Component List for domain C: No System boards No IO boards Available Component List for domain D: No System boards No IO boards Available Component List for domain E: No System boards No IO boards Available Component List for domain F: No System boards No IO boards Available Component List for domain G: No System boards No IO boards Available Component List for domain H: No System boards No IO boards Available Component List for domain I: No System boards No IO boards Available Component List for domain J: No System boards No IO boards Available Component List for domain K: No System boards No IO boards Available Component List for domain L: No System boards No IO boards Available Component List for domain M: No System boards No IO boards Available Component List for domain N: No System boards No IO boards Available Component List for domain O: No System boards No IO boards Available Component List for domain P: No System boards No IO boards Available Component List for domain Q: No System boards No IO boards Available Component List for domain R: No System boards No IO boards Domain Ethernet Addresses: ========================== Domain ID Domain Tag Ethernet Address A - 0:0:be:a9:fc:24 B - 0:0:be:a9:fc:25 C - 0:0:be:a9:fc:26 D - 0:0:be:a9:fc:27 E - 0:0:be:a9:fc:28 F - 0:0:be:a9:fc:29 G - 0:0:be:a9:fc:2a H - 0:0:be:a9:fc:2b I - 0:0:be:a9:fc:2c J - 0:0:be:a9:fc:2d K - 0:0:be:a9:fc:2e L - 0:0:be:a9:fc:2f M - 0:0:be:a9:fc:30 N - 0:0:be:a9:fc:31 O - 0:0:be:a9:fc:32 P - 0:0:be:a9:fc:33 Q - 0:0:be:a9:fc:34 R - 0:0:be:a9:fc:35 Domain configurations: ====================== Domain ID Domain Tag Solaris Nodename Domain Status A - PKEND021 Running Solaris B - PKENA019 Running Solaris C - PSBLD008 Running Solaris D - - Powered Off E - PEAID015 Running Solaris F - - Powered Off G - - Powered Off H - - Powered Off I - - Powered Off J - - Powered Off K - - Powered Off L - - Powered Off M - - Powered Off N - - Powered Off O - - Powered Off P - - Powered Off Q - - Powered Off R - - Powered Off
----------------------------------------------------------------------------------Check whicha all are the boads for domain C(Siebel) present.3)bash-2.05$ showboards -d CRetrieving board information. Please wait. ...... Location Pwr Type of Board Board Status Test Status Domain -------- --- ------------- ------------ ----------- ------ SB3 On V3CPU Active Passed C SB4 On V3CPU Active Passed C SB5 On V3CPU Active Passed C SB6 On V3CPU Active Passed C SB7 On V3CPU Active Passed C SB9 - Empty Slot Available - Isolated SB10 - Empty Slot Available - Isolated SB11 - Empty Slot Available - Isolated SB12 - Empty Slot Available - Isolated SB13 - Empty Slot Available - Isolated SB14 - Empty Slot Available - Isolated SB15 - Empty Slot Available - Isolated SB16 - Empty Slot Available - Isolated SB17 - Empty Slot Available - Isolated IO4 On HPCI+ Active Passed C IO5 On HPCI+ Active Passed C IO9 - Empty Slot Available - Isolated IO10 - Empty Slot Available - Isolated IO11 - Empty Slot Available - Isolated IO12 - Empty Slot Available - Isolated IO13 - Empty Slot Available - Isolated IO14 - Empty Slot Available - Isolated IO15 - Empty Slot Available - Isolated IO16 - Empty Slot Available - Isolated IO17 - Empty Slot Available - Isolated
we can see that Domain C is using Boards SB3,SB4,SB5,SB6,SB7 with IO4 and IO5.we want to reconfigure SB6 to Domain D.--------------------------------------------------------------------------Login to the domain C and check for the boards configured.Also check which is permanant board4)bash-2.05$ console -d Croot@PSBLD008 # cfgadm -alv |grep -i permanent SB5::memory connected configured ok base address 0x20000000000, 16777216 KBytes total, 14916912 KBytes permanent
root@PSBLD008 # cfgadm -l -s "select=class(sbd)"Ap_Id Type Receptacle Occupant Condition IO4 HPCI+ connected configured ok IO5 HPCI+ connected configured ok SB3 V3CPU connected configured ok SB4 V3CPU connected configured ok SB5 V3CPU connected configured ok SB6 V3CPU connected configured ok SB7 V3CPU connected configured ok
--------------------------------------------------------------------now we will unconfigure the 6th board SB6 and check with top command whether the memory has changed or not.5)root@PSBLD008 #cfgadm -v -c unconfigure SB6
request delete capacity (8 cpus) request delete capacity (4194304 pages) request delete capacity SB6 done request offline SUNW_cpu/cpu192 request offline SUNW_cpu/cpu196 request offline SUNW_cpu/cpu193 request offline SUNW_cpu/cpu197 request offline SUNW_cpu/cpu194 request offline SUNW_cpu/cpu198 request offline SUNW_cpu/cpu195 request offline SUNW_cpu/cpu199 request offline SUNW_cpu/cpu192 done request offline SUNW_cpu/cpu196 done request offline SUNW_cpu/cpu193 done request offline SUNW_cpu/cpu197 done request offline SUNW_cpu/cpu194 done request offline SUNW_cpu/cpu198 done request offline SUNW_cpu/cpu195 done request offline SUNW_cpu/cpu199 done unconfigure SB6 Apr 27 00:20:13 PSBLD008 dr: OS unconfigure dr@0:SB6::cpu0 Apr 27 00:20:24 PSBLD008 dr: OS unconfigure dr@0:SB6::cpu1 Apr 27 00:20:45 PSBLD008 dr: OS unconfigure dr@0:SB6::cpu2 Apr 27 00:20:57 PSBLD008 dr: OS unconfigure dr@0:SB6::cpu3 Apr 27 00:21:18 PSBLD008 dr: OS unconfigure dr@0:SB6::memory unconfigure SB6 done notify remove SUNW_cpu/cpu192 notify remove SUNW_cpu/cpu196 notify remove SUNW_cpu/cpu193 notify remove SUNW_cpu/cpu197 notify remove SUNW_cpu/cpu194 notify remove SUNW_cpu/cpu198 notify remove SUNW_cpu/cpu195 notify remove SUNW_cpu/cpu199 notify remove SUNW_cpu/cpu192 done notify remove SUNW_cpu/cpu196 done notify remove SUNW_cpu/cpu193 done notify remove SUNW_cpu/cpu197 done notify remove SUNW_cpu/cpu194 done notify remove SUNW_cpu/cpu198 done notify remove SUNW_cpu/cpu195 done notify remove SUNW_cpu/cpu199 done notify capacity change (8 cpus) notify capacity change (4194304 pages) notify capacity change SB6 done you have mail root@PSBLD008 #
root@PSBLD008 # top[H [Jlast pid: 20303; load averages: 6.22, 6.37, 5.07 [1;149H00:26:34 1023 processes:1015 sleeping, 3 zombie, 2 stopped, 3 on cpu CPU states: % idle, % user, % kernel, % iowait, % swap Memory: 64G real, 43G free, 16G swap in use, 72G swap free PID USERNAME LWP PRI NICE SIZE RES STATE TIME CPU COMMAND 17385 dbmonito 11 0 2 12G 8271M sleep 7:37 2.34% oracle 18962 oracle 1 0 0 12G 8261M cpu/224 4:50 2.30% oracle 17391 dbmonito 11 0 2 12G 8308M sleep 6:17 1.55% oracle 17389 dbmonito 11 0 2 12G 8305M sleep 6:15 1.54% oracle 17130 oracle 1 0 0 12G 8258M sleep 3:52 1.08% oracle 15334 oracle 1 60 0 12G 8258M sleep 6:29 0.98% oracle 17383 dbmonito 11 15 2 12G 8309M sleep 1:35 0.46% oracle 21602 oracle 1 46 0 12G 8258M cpu/161 304:05 0.23% oracle 2823 oracle 1 30 0 12G 8258M sleep 0:48 0.14% oracle 22925 oracle 1 40 0 12G 8256M sleep 331:53 0.13% oracle 21497 oracle 1 40 0 12G 8256M sleep 334:40 0.12% oracle 15185 oracle 11 60 0 12G 8262M sleep 0:49 0.10% oracle 20303 root 1 0 0 3728K 2688K cpu/128 0:00 0.09% top 3508 oracle 20 53 0 12G 8247M sleep 124.9H 0.09% oracle 20639 oracle 1 45 0 12G 8260M sleep 0:13 0.09% oracle
--------------------------------------------------------------------------Login to console and again check for board status6)
bash-2.05$ showboards -d C Retrieving board information. Please wait. ............. Location Pwr Type of Board Board Status Test Status Domain -------- --- ------------- ------------ ----------- ------ SB3 On V3CPU Active Passed C SB4 On V3CPU Active Passed C SB5 On V3CPU Active Passed C SB6 On V3CPU Active Passed C SB7 On V3CPU Active Passed C SB9 - Empty Slot Available - Isolated SB10 - Empty Slot Available - Isolated SB11 - Empty Slot Available - Isolated SB12 - Empty Slot Available - Isolated SB13 - Empty Slot Available - Isolated SB14 - Empty Slot Available - Isolated SB15 - Empty Slot Available - Isolated SB16 - Empty Slot Available - Isolated SB17 - Empty Slot Available - Isolated IO4 On HPCI+ Active Passed C IO5 On HPCI+ Active Passed C IO9 - Empty Slot Available - Isolated IO10 - Empty Slot Available - Isolated IO11 - Empty Slot Available - Isolated IO12 - Empty Slot Available - Isolated IO13 - Empty Slot Available - Isolated IO14 - Empty Slot Available - Isolated IO15 - Empty Slot Available - Isolated IO16 - Empty Slot Available - Isolated IO17 - Empty Slot Available - Isolated
again we will find the board as active and still configured so we need to disconnect and delete this board.
------------------------------------------------------------
Login again to domain C to disconnect the board
7)console -d C
Trying to connect...
Connected to Domain Server.
Your console is in exclusive mode now.
|-----------------------------------------------------------------|
| This system is for the use of authorized users only. |
| Individuals using this computer system without authority, or in |
| excess of their authority, are subject to having all of their |
| activities on this system monitored and recorded by system |
| personnel. |
| |
| In the course of monitoring individuals improperly using this |
| system, or in the course of system maintenance, the activities |
| of authorized users may also be monitored. |
| |
| Anyone using this system expressly consents to such monitoring |
| and is advised that if such monitoring reveals possible |
| evidence of criminal activity, system personnel may provide the |
| evidence of such monitoring to law enforcement officials. |
|-----------------------------------------------------------------|
PSBLD008 console login: root
Password:
Last login: Tue Apr 26 23:38:33 from 10.1.41.57
Apr 27 00:28:08 PSBLD008 login: ROOT LOGIN /dev/console
Sun Microsystems Inc. SunOS 5.9 Generic May 2002
WARNING:Only Tata Sky authorized users are permitted to use this System.
This System is under constant surveillance and monitoring. Any security
violation shall lead to disciplinary action as per the Tata Sky IT User
Policies and Guidelines.
SunOS PSBLD008 5.9 Generic_118558-21 sun4u sparc SUNW,Sun-Fire-15000
************************System Serial Number & Model********************
0609AK20BA
Memory size: 98304 Megabytes, (20 CPU, US-IV+ , dual core , total logical 40 )
***********************************************************************
You have new mail.
Sourcing //.profile-EIS.....
root@PSBLD008 #
root@PSBLD008 # bash
root@PSBLD008 # cfgadm -v -c disconnect SB6
disconnect SB6
disconnect SB6 done
poweroff SB6
poweroff SB6 done
unassign SB6 skipped
root@PSBLD008 #
------------------------------------------------------------
login to console and delete the board and if you give the showboard against C and D domain, you can see the difference.Instead of adding and deleting we can do this using command 'moveboard'(moveboard -d D configure SB6).But i am describing it in a step-by-step method
8)bash-2.05$ deleteboard SB6
SB6 successfully unassigned.
bash-2.05$ showboards -d C Retrieving board information. Please wait. ............ Location Pwr Type of Board Board Status Test Status Domain -------- --- ------------- ------------ ----------- ------ SB3 On V3CPU Active Passed C SB4 On V3CPU Active Passed C SB5 On V3CPU Active Passed C SB6 Off V3CPU Available Unknown Isolated SB7 On V3CPU Active Passed C SB9 - Empty Slot Available - Isolated SB10 - Empty Slot Available - Isolated SB11 - Empty Slot Available - Isolated SB12 - Empty Slot Available - Isolated SB13 - Empty Slot Available - Isolated SB14 - Empty Slot Available - Isolated SB15 - Empty Slot Available - Isolated SB16 - Empty Slot Available - Isolated SB17 - Empty Slot Available - Isolated IO4 On HPCI+ Active Passed C IO5 On HPCI+ Active Passed C IO9 - Empty Slot Available - Isolated IO10 - Empty Slot Available - Isolated IO11 - Empty Slot Available - Isolated IO12 - Empty Slot Available - Isolated IO13 - Empty Slot Available - Isolated IO14 - Empty Slot Available - Isolated IO15 - Empty Slot Available - Isolated IO16 - Empty Slot Available - Isolated IO17 - Empty Slot Available - Isolatedbash-2.05$ addboard -d D SB6Domain: D is not running. You can only "configure" a component into a running domain. Would you like to "assign" the component(s) to domain D instead (yes/no)? yes^?^? no bash-2.05$ showboards -d D Retrieving board information. Please wait. ...... Location Pwr Type of Board Board Status Test Status Domain -------- --- ------------- ------------ ----------- ------ SB6 Off V3CPU Available Unknown Isolated SB9 - Empty Slot Available - Isolated SB10 - Empty Slot Available - Isolated SB11 - Empty Slot Available - Isolated SB12 - Empty Slot Available - Isolated SB13 - Empty Slot Available - Isolated SB14 - Empty Slot Available - Isolated SB15 - Empty Slot Available - Isolated SB16 - Empty Slot Available - Isolated SB17 - Empty Slot Available - Isolated IO6 Off HPCI+ Assigned Unknown D IO7 Off HPCI+ Assigned Unknown D IO9 - Empty Slot Available - Isolated IO10 - Empty Slot Available - Isolated IO11 - Empty Slot Available - Isolated IO12 - Empty Slot Available - Isolated IO13 - Empty Slot Available - Isolated IO14 - Empty Slot Available - Isolated IO15 - Empty Slot Available - Isolated IO16 - Empty Slot Available - Isolated IO17 - Empty Slot Available - Isolated(Don't get confused because my Domain D was already powered off).-----------------------------------------------------------------------------------Check for the platform again.9)bash-2.05$ showplatformPLATFORM: ========= Platform Type: Sun Fire E20K CSN: ==== Chassis Serial Number: 0609AK20BA COD: ==== Chassis HostID: 5014936D87943 Proc RTUs installed: 0 PROC Headroom Quantity: 0 Proc RTUs reserved for domain A: 0 Proc RTUs reserved for domain B: 0 Proc RTUs reserved for domain C: 0 Proc RTUs reserved for domain D: 0 Proc RTUs reserved for domain E: 0 Proc RTUs reserved for domain F: 0 Proc RTUs reserved for domain G: 0 Proc RTUs reserved for domain H: 0 Proc RTUs reserved for domain I: 0 Proc RTUs reserved for domain J: 0 Proc RTUs reserved for domain K: 0 Proc RTUs reserved for domain L: 0 Proc RTUs reserved for domain M: 0 Proc RTUs reserved for domain N: 0 Proc RTUs reserved for domain O: 0 Proc RTUs reserved for domain P: 0 Proc RTUs reserved for domain Q: 0 Proc RTUs reserved for domain R: 0 Available Component List for Domains: ===================================== Available Component List for domain A: No System boards No IO boards Available Component List for domain B: No System boards No IO boards Available Component List for domain C: No System boards No IO boards Available Component List for domain D: No System boards No IO boards Available Component List for domain E: No System boards No IO boards Available Component List for domain F: No System boards No IO boards Available Component List for domain G: No System boards No IO boards Available Component List for domain H: No System boards No IO boards Available Component List for domain I: No System boards No IO boards Available Component List for domain J: No System boards No IO boards Available Component List for domain K: No System boards No IO boards Available Component List for domain L: No System boards No IO boards Available Component List for domain M: No System boards No IO boards Available Component List for domain N: No System boards No IO boards Available Component List for domain O: No System boards No IO boards Available Component List for domain P: No System boards No IO boards Available Component List for domain Q: No System boards No IO boards Available Component List for domain R: No System boards No IO boards Domain Ethernet Addresses: ========================== Domain ID Domain Tag Ethernet Address A - 0:0:be:a9:fc:24 B - 0:0:be:a9:fc:25 C - 0:0:be:a9:fc:26 D - 0:0:be:a9:fc:27 E - 0:0:be:a9:fc:28 F - 0:0:be:a9:fc:29 G - 0:0:be:a9:fc:2a H - 0:0:be:a9:fc:2b I - 0:0:be:a9:fc:2c J - 0:0:be:a9:fc:2d K - 0:0:be:a9:fc:2e L - 0:0:be:a9:fc:2f M - 0:0:be:a9:fc:30 N - 0:0:be:a9:fc:31 O - 0:0:be:a9:fc:32 P - 0:0:be:a9:fc:33 Q - 0:0:be:a9:fc:34 R - 0:0:be:a9:fc:35 Domain configurations: ====================== Domain ID Domain Tag Solaris Nodename Domain Status A - PKEND021 Running Solaris B - PKENA019 Running Solaris C - PSBLD008 Running Solaris D - - Powered Off E - PEAID015 Running Solaris F - - Powered Off G - - Powered Off H - - Powered Off I - - Powered Off J - - Powered Off K - - Powered Off L - - Powered Off M - - Powered Off N - - Powered Off O - - Powered Off P - - Powered Off Q - - Powered Off R - - Powered Off
----------------------------------------------------------------------Thus the board is removed from one domain.Now our activity is to add the board in new domain.and after that check for all the boards in D domain.10)bash-2.05$ addboard -d D SB6Domain: D is not running. You can only "configure" a component into a running domain. Would you like to "assign" the component(s) to domain D instead (yes/no)? yes SB6 assigned to domain: D
bash-2.05$showboards -d D Retrieving board information. Please wait. ......... Location Pwr Type of Board Board Status Test Status Domain -------- --- ------------- ------------ ----------- ------ SB6 Off V3CPU Assigned Unknown D SB9 - Empty Slot Available - Isolated SB10 - Empty Slot Available - Isolated SB11 - Empty Slot Available - Isolated SB12 - Empty Slot Available - Isolated SB13 - Empty Slot Available - Isolated SB14 - Empty Slot Available - Isolated SB15 - Empty Slot Available - Isolated SB16 - Empty Slot Available - Isolated SB17 - Empty Slot Available - Isolated IO6 Off HPCI+ Assigned Unknown D IO7 Off HPCI+ Assigned Unknown D IO9 - Empty Slot Available - Isolated IO10 - Empty Slot Available - Isolated IO11 - Empty Slot Available - Isolated IO12 - Empty Slot Available - Isolated IO13 - Empty Slot Available - Isolated IO14 - Empty Slot Available - Isolated IO15 - Empty Slot Available - Isolated IO16 - Empty Slot Available - Isolated IO17 - Empty Slot Available - Isolated
----------------------------------------------------------------------------------Now we have to turn on the keyswitch.Sometimes if the domain is down,it will automatically turn on the domain.11)bash-2.05$ showkeyswitch -d DVirtual key switch position: OFF
bash-2.05$ setkeyswitch -d D -l 32 on
Powering on: CSB at CS1 Already powered on: CSB at CS1 Powering on: CSB at CS0 Already powered on: CSB at CS0 Powering on: EXB at EX6 Powering on: HPCI+ at IO6 Powering on: V3CPU at SB6 Waiting on exclusive access to EXB(s): 00080. Powering on: EXB at EX7 Already powered on: EXB at EX7 Powering on: HPCI+ at IO7 Significant contents of .postrc (platform) /etc/opt/SUNWSMS/SMS1.6/config/platform/.postrc: # ident "@(#)postrc 1.1 01/04/02 SMI" Reading domain blacklist file /etc/opt/SUNWSMS/config/D/blacklist ... # ident "@(#)blacklist 1.1 01/04/02 SMI" Reading platform blacklist file /etc/opt/SUNWSMS/config/platform/blacklist ... # ident "@(#)blacklist 1.1 01/04/02 SMI" SEEPROM probe took 0 seconds. Reading Component Health Status (CHS) information ... stage lport_reset: Assert reset to IOC ports in -Q mode... stage_lport_reset(): Not -Q mode; Skipping Stage lport_reset stage bus_probe: Check in-use bus configurations... stage asic_probe: ASIC probe and JTAG/CBus integrity test... FAIL Slot IO7: Shares an expander with an unthrottled USIV+ board which is active in another domain. There is no FRU service action indicated for this failure. stage brd_rev_eval: Board Revision Evaluation and Compliance... stage cpu_probe: CPU Module probe... stage cdc_probe: CDC DIMM probe... stage mem_probe: Memory dimm probe... stage adapter_probe: I/O adapter probe... stage cp_shorts: Centerplane Shorts... stage lbist: Logic BIST... stage ibist: Interconnect BIST... stage field_ict: Field Interconnect Tests... stage mbist1: Internal memory BIST... stage mbist2: External memory BIST... stage domain_sync: Domain sync test... stage cbus_bbsram: Console Bus test of bootbus sram... stage sc_interrupt: DARB to SC interrupt... stage cdc_clear: CDC DIMM clear... stage cpu_lpost: Test all L1 CPU boards... Performing ASIC config with bus config a/d/r = 333... Slot0 in domain: 00040 Slot1 in domain: 00040 EXBs in use: 001BF sgcpu.flash file: Version 5.20.0 Build 15.1 I/F 12 is newest supported Fprom SB6/F0: Version 5.20.0 Build 15.1 I/F 12 is newest supported Fprom SB6/F1: Version 5.20.0 Build 15.1 I/F 12 is newest supported stage nmb_cpu_lpost: Non-Mem Board Proc tests... Performing ASIC config with bus config a/d/r = 333... Slot0 in domain: 00040 Slot1 in domain: 00040 EXBs in use: 001BF stage_cpu_lpost(): No NMB Boards in config. Skipping Stage nmb_cpu_lpost. Acquiring licenses for all good processors... stage wib_lpost: Wildcat interface board tests... stage_wib_lpost(): No good Wcis; Skipping Stage wib_lpost stage pci_lpost: Test all L1 I/O boards... Performing ASIC config with bus config a/d/r = 333... Slot0 in domain: 00040 Slot1 in domain: 00040 EXBs in use: 001BF pcilpost.elf Version 5.20.0 Build 15.1 I/F 12 is newest supported stage exp_lpost: Domain-level board and system tests... explpost.elf Version 5.20.0 Build 15.1 I/F 12 is newest supported stage cpu_lpost_II: CPU L1 domain/system tests... sgcpu.flash file: Version 5.20.0 Build 15.1 I/F 12 is newest supported Fprom SB6/F0: Version 5.20.0 Build 15.1 I/F 12 is newest supported Fprom SB6/F1: Version 5.20.0 Build 15.1 I/F 12 is newest supported stage pci_lpost_Q: Init all L1 I/O boards under -Q... stage cpu_lpost_II_Q: CPU L1 domain/system init under -Q... stage final_config: Final configuration... Creating CPU SRAM handoff structures... Creating GDCD IOSRAM handoff structures in Slot IO6... Writing domain information to PCD... Key to resource status value codes: ?=Unknown p=Present c=Crunched _=Undefined m=Missing i=Misconfig o=FailedOBP f=Failed b=Blacklisted r=Redlisted x=NotInDomain u=G,unconfig P=Passed ==G,lockstep l=NoLicense e=EmptyCasstt CPU_Brds: PortCore 3 2 1 0 Mem P/B: 3/1 3/0 2/1 2/0 1/1 1/0 0/1 0/0 Slot Gen 10101010 /L: 10 10 10 10 10 10 10 10 CDC SB06: P PPPPPPPP PP PP PP PP PP PP PP PP P I/O_Brds: IOC P1/Bus/Adapt IOC P0/Bus/Adapt Slot Gen Type P1 B1/10 B0/10 P0 B1/eb10 B0/10 (e=ENet, b=BBC) IO06: P hsPCI+ P p _p p _p P p PP_e p _p Configured in 333 with 4 procs, 32.000 GBytes, 3 IO adapters. Interconnect frequency is 149.987 MHz, Measured. Golden sram is on Slot IO6. POST (level=32, verbose=20) execution time 9:47
Thus the board is replaced and reconfigured to new domain in live.
Sunday, 10 April 2011
solutions for solaris,backup and storage: Hardware RAID configuration in T3-1 Oracle server
solutions for solaris,backup and storage: Hardware RAID configuration in T3-1 Oracle server: "Today I got a chance to configure Hardware RAID in T3-1 oracle server. T3-1 servers are recently launched Chip Multithread servers (CMT) fro..."
Hardware RAID configuration in T3-1 Oracle server
Today I got a chance to configure Hardware RAID in T3-1 oracle server.
T3-1 servers are recently launched Chip Multithread servers (CMT) from oracle which provides a good platform for virtualization applications especially for LDOMS(Unfortunate that i havent tried yet),Containers..
The method of configuring Hardware Raid is simple.The thing is that we will get confused after following steps suggested from Oracle.I have made it simpler.Its mentioned below.Seems Interesting.
Login to the server,you will find 2 harddisk in the server(By default).
1)root@PSBL223 # format
Searching for disks...done
AVAILABLE DISK SELECTIONS:
0. c0t5000CCA0151C7EB4d0 <SUN300G cyl 46873 alt 2 hd 20 sec 625>
/scsi_vhci/disk@g5000cca0151c7eb4
1. c0t5000CCA0151C7F80d0 <SUN300G cyl 46873 alt 2 hd 20 sec 625>
/scsi_vhci/disk@g5000cca0151c7f80
Specify disk (enter its number): ^C
T3-1 servers are recently launched Chip Multithread servers (CMT) from oracle which provides a good platform for virtualization applications especially for LDOMS(Unfortunate that i havent tried yet),Containers..
The method of configuring Hardware Raid is simple.The thing is that we will get confused after following steps suggested from Oracle.I have made it simpler.Its mentioned below.Seems Interesting.
Login to the server,you will find 2 harddisk in the server(By default).
1)root@PSBL223 # format
Searching for disks...done
AVAILABLE DISK SELECTIONS:
0. c0t5000CCA0151C7EB4d0 <SUN300G cyl 46873 alt 2 hd 20 sec 625>
/scsi_vhci/disk@g5000cca0151c7eb4
1. c0t5000CCA0151C7F80d0 <SUN300G cyl 46873 alt 2 hd 20 sec 625>
/scsi_vhci/disk@g5000cca0151c7f80
Specify disk (enter its number): ^C
-----------------------------------------------------------
Now poweroff server to go to SC prompt.What i did was, i gave a tip connection from another solaris server and worked from a remote location.
2)#init 5
-----------------------------------------------------------
Start the SP console.
Note: Before starting the console please check in which all slots are the HDD's are inserted in.When I got the server 2 HDD's were there, in which 1 HDD was in the 0th slot and other one in 3rd slot.What i did was i took out the HDD from 3rd slot and inserted it in 1st slot
3)-->start /SYS
Are you sure you want to start /SYS (y/n)? y
Starting /SYS
/SYS
Targets:
MB
MB_ENV
USBBD
DVD
PS0
PS1
SASBP
PDB
CONNBD
FANBD
INTSW
PWRBS
VPS
T_AMB
OK
LOCATE
SERVICE
PS_FAULT
TEMP_FAULT
FAN_FAULT
Properties:
type = Host System
ipmi_name = /SYS
keyswitch_state = Normal
product_name = SPARC T3-1
product_part_number = 4745525-4
product_serial_number = 1052BDR5A3
product_manufacturer = Oracle Corporation
fault_state = OK
clear_fault_action = (none)
power_state = On
Commands:
cd
reset
set
show
start
stop
----------------------------------------------------------------
Now you can go to the console from SP conssole
4)-> start /HOST/console
Are you sure you want to start /HOST/console (y/n)? y
Serial console started. To stop, type #.
----------------------------------------------------------------
Check for all the devices connected in server using below command and you will find the below output
5){0} ok show-devs
/pci-performance-counters@400
/niu@480
/pci@400
/cpu@7f
/cpu@7e
/cpu@7d
/cpu@7c
/cpu@7b
/cpu@7a
/cpu@79
/cpu@78
/cpu@77
/cpu@76
/cpu@75
/cpu@74
/cpu@73
/cpu@72
/cpu@71
/cpu@70
/cpu@6f
/cpu@6e
/cpu@6d
[7m More [<space>,<cr>,q,n,p,c] ? [m
/cpu@6c
/cpu@6b
/cpu@6a
/cpu@69
/cpu@68
/cpu@67
/cpu@66
/cpu@65
/cpu@64
/cpu@63
/cpu@62
/cpu@61
/cpu@60
/cpu@5f
/cpu@5e
/cpu@5d
/cpu@5c
/cpu@5b
/cpu@5a
/cpu@59
/cpu@58
/cpu@57
/cpu@56
[7m More [<space>,<cr>,q,n,p,c] ? [m
/cpu@55
/cpu@54
/cpu@53
/cpu@52
/cpu@51
/cpu@50
/cpu@4f
/cpu@4e
/cpu@4d
/cpu@4c
/cpu@4b
/cpu@4a
/cpu@49
/cpu@48
/cpu@47
/cpu@46
/cpu@45
/cpu@44
/cpu@43
/cpu@42
/cpu@41
/cpu@40
/cpu@3f
[7m More [<space>,<cr>,q,n,p,c] ? [m
/cpu@3e
/cpu@3d
/cpu@3c
/cpu@3b
/cpu@3a
/cpu@39
/cpu@38
/cpu@37
/cpu@36
/cpu@35
/cpu@34
/cpu@33
/cpu@32
/cpu@31
/cpu@30
/cpu@2f
/cpu@2e
/cpu@2d
/cpu@2c
/cpu@2b
/cpu@2a
/cpu@29
/cpu@28
[7m More [<space>,<cr>,q,n,p,c] ? [m
/cpu@27
/cpu@26
/cpu@25
/cpu@24
/cpu@23
/cpu@22
/cpu@21
/cpu@20
/cpu@1f
/cpu@1e
/cpu@1d
/cpu@1c
/cpu@1b
/cpu@1a
/cpu@19
/cpu@18
/cpu@17
/cpu@16
/cpu@15
/cpu@14
/cpu@13
/cpu@12
/cpu@11
[7m More [<space>,<cr>,q,n,p,c] ? [m
/cpu@10
/cpu@f
/cpu@e
/cpu@d
/cpu@c
/cpu@b
/cpu@a
/cpu@9
/cpu@8
/cpu@7
/cpu@6
/cpu@5
/cpu@4
/cpu@3
/cpu@2
/cpu@1
/cpu@0
/virtual-devices@100
/iscsi-hba
/virtual-memory
/memory@m0,20000000
/aliases
/options
[7m More [<space>,<cr>,q,n,p,c] ? [m
/openprom
/chosen
/packages
/pci@400/pci@2
/pci@400/pci@1
/pci@400/pci@2/pci@0
/pci@400/pci@2/pci@0/pci@f
/pci@400/pci@2/pci@0/pci@c
/pci@400/pci@2/pci@0/pci@a
/pci@400/pci@2/pci@0/pci@8
/pci@400/pci@2/pci@0/pci@7
/pci@400/pci@2/pci@0/pci@6
/pci@400/pci@2/pci@0/pci@4
/pci@400/pci@2/pci@0/pci@0
/pci@400/pci@2/pci@0/pci@f/pci@0
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2
[7m More [<space>,<cr>,q,n,p,c] ? [m
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,1
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2/hub@4
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2/hub@2
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2/hub@2/device@4
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2/hub@2/hub@3
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2/hub@2/device@4/mouse@1
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2/hub@2/device@4/keyboard@0
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2/hub@2/hub@3/storage@2
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2/hub@2/hub@3/storage@2/disk
/pci@400/pci@2/pci@0/pci@7/network@0,1
/pci@400/pci@2/pci@0/pci@7/network@0
/pci@400/pci@2/pci@0/pci@6/network@0,1
/pci@400/pci@2/pci@0/pci@6/network@0
/pci@400/pci@2/pci@0/pci@4/scsi@0
/pci@400/pci@2/pci@0/pci@4/scsi@0/disk
/pci@400/pci@2/pci@0/pci@4/scsi@0/tape
/pci@400/pci@2/pci@0/pci@0/pci@0
/pci@400/pci@2/pci@0/pci@0/pci@0/display@0
/pci@400/pci@1/pci@0
/pci@400/pci@1/pci@0/pci@8
/pci@400/pci@1/pci@0/pci@6
/pci@400/pci@1/pci@0/pci@4
[7m More [<space>,<cr>,q,n,p,c] ? [m
/pci@400/pci@1/pci@0/pci@0
/pci@400/pci@1/pci@0/pci@4/scsi@0
/pci@400/pci@1/pci@0/pci@4/scsi@0/disk
/pci@400/pci@1/pci@0/pci@4/scsi@0/tape
/virtual-devices@100/channel-devices@200
/virtual-devices@100/console@1
/virtual-devices@100/random-number-generator@e
/virtual-devices@100/ncp@6
/virtual-devices@100/n2cp@7
/virtual-devices@100/tpm@f
/virtual-devices@100/flashprom@0
/virtual-devices@100/channel-devices@200/virtual-domain-service@0
/virtual-devices@100/channel-devices@200/virtual-channel-client@2
/virtual-devices@100/channel-devices@200/virtual-channel-client@1
/virtual-devices@100/channel-devices@200/virtual-channel@3
/virtual-devices@100/channel-devices@200/virtual-channel@0
/iscsi-hba/disk
/openprom/client-services
/packages/obp-tftp
/packages/kbd-translator
/packages/SUNW,asr
/packages/dropins
/packages/terminal-emulator
[7m More [<space>,<cr>,q,n,p,c] ? [m
/packages/disk-label
/packages/deblocker
/packages/SUNW,builtin-drivers
{0} ok
{0} ok
----------------------------------------------------------------------------
Run probe-scsi-all command and check its showing all targets present in the server
Here you can find 2 targets Target 9 and Target a,though only Target 9 is working..we need to select new SCSI device.
6){0} ok probe-scsi-all
/pci@400/pci@2/pci@0/pci@f/pci@0/usb@0,2/hub@2/hub@3/storage@2
Unit 0 Removable Read Only device AMI Virtual CDROM 1.00
/pci@400/pci@2/pci@0/pci@4/scsi@0
FCode Version 1.00.54 , MPT Version 2.00, Firmware Version 5.00.17.00
Target 9
Unit 0 Disk HITACHI H103030SCSUN300G A2A8 585937500 Blocks, 300 GB
SASDeviceName 5000cca0151c7f80 SASAddress 5000cca0151c7f81 PhyNum 0
Target a
Unit 0 Removable Read Only device TEAC DV-W28SS-R 1.0C
SATA device PhyNum 6
/pci@400/pci@1/pci@0/pci@4/scsi@0
FCode Version 1.00.54 , MPT Version 2.00, Firmware Version 5.00.17.00
Target 9
Unit 0 Disk HITACHI H103030SCSUN300G A2A8 585937500 Blocks, 300 GB
SASDeviceName 5000cca0151c7eb4 SASAddress 5000cca0151c7eb5 PhyNum 0
-------------------------------------------------------------------------------------------
Now if you give you show-children we can clearly see only one target.
7){0} ok select /pci@400/pci@1/pci@0/pci@4/scsi@0 (or select scsi0)
{0} ok show-children
FCode Version 1.00.54 , MPT Version 2.00, Firmware Version 5.00.17.00
Target 9
Unit 0 Disk HITACHI H103030SCSUN300G A2A8 585937500 Blocks, 300 GB
SASDeviceName 5000cca0151c7eb4 SASAddress 5000cca0151c7eb5 PhyNum 0
-----------------------------------------------------------------
Now select the other scsi1 in which we have inserted the 2nd Hdd. The 2 devices needed to be selected i have marked in red colour in show-devs OBP command
8){0} ok select /pci@400/pci@2/pci@0/pci@4/scsi@0
{0} ok show-children
FCode Version 1.00.54, MPT Version 2.00, Firmware Version 5.00.17.00
Target 9
Unit 0 Disk HITACHI H103030SCSUN300G A2A8 585937500 Blocks, 300 GB
SASDeviceName 5000cca0151c7eb4 SASAddress 5000cca0151c7eb5 PhyNum 0
Target a
Unit 0 Disk HITACHI H103030SCSUN300G A2A8 585937500 Blocks, 300 GB
SASDeviceName 5000cca0151c7f80 SASAddress 5000cca0151c7f81 PhyNum 1
-----------------------------------------------------------------
Now you can create a RAID using following command with volume size and a volume name.
9){0} ok 9 a create-raid1-volume
Target 9 size is 583983104 Blocks, 298 GB
Target a size is 583983104 Blocks, 298 GB
The volume can be any size from 1 MB to 285148 MB
What size do you want? [285148]
Volume size will be 583983104 Blocks, 298 GB
Enter a volume name: [0 to 15 characters] v0
Volume has been created
-----------------------------------------------------------------------------
Check for if its showing volumes properly
10){0} ok show-volumes
Volume 0 Target 389 Type RAID1 (Mirroring)
Name v0 WWID 04b8a3893433dc55
Optimal Enabled Background Init In Progress
2 Members 583983104 Blocks, 298 GB
Disk 0
Primary Optimal
Target 9 HITACHI H103030SCSUN300G A2A8
Disk 1
Secondary Optimal
Target a HITACHI H103030SCSUN300G A2A8
-------------------------------------------------------------
After starting the server please verify the format output that it will show only one disk as Raid formation of 2 disk.
11)# formatSearching for disks...done c0t34B8A3893433DC55d0: configured with capacity of 277.99GB AVAILABLE DISK SELECTIONS: 0. c0t34B8A3893433DC55d0 <LSI-LogicalVolume-3000 cyl 65533 alt 2 hd 64 sec 139> /pci@400/pci@1/pci@0/pci@4/scsi@0/iport@v0/disk@w34b8a3893433dc55,0 Specify disk (enter its number): 0 selecting c0t34B8A3893433DC55d0 [disk formatted] Disk not labeled. Label it now? n
Thus Hardware RAID is configured..In case of any doubts please ask your questions in my blog..
Saturday, 9 April 2011
X-Manager Configuration in Solaris 10
I was asked to install oracle client. but then i found it difficult as i was not able to login to server through X-Manager.The server was Oracle Solaris 10 and it was in a T3-1 server.A server recently launched with New feature of Hardware RAID.Configuration of hardware raid in T3-1 will be posted in my next article.Here is how i configured X-manager for the server through service configure
Earlier in Solaris 9(before release of solaris 10 )every services was in the form of file systems.Like for starting SSH, u need to go to /etc/init.d and start a particular service linked with rc.xx scripts.But after the release of Solaris 10 A concept like that of registry in windows was put into this unix flavour...andn this structure we can configure.Seems interestin, right???.Another thing is even we can make a new service
Before that let me put a word on X-manager which is a tool through which we can login to any Unix server in GUI mode.There are some other tools like humming Bird..etc...But X-Manager is preferred due to its flexibility and performance.
Through command svccfg we can configure any service
1)# svccfg
svc:>
from here if you press tab button 2 times you can see which all are the services.and for X-Manager select the service 'cde-login'
2)select cde-login
svc:/application/graphical-login/cde-login>
here again if you give tab button 2 times you can see various options.I have listed property 'arg'
3)svc:/application/graphical-login/cde-login>
Earlier in Solaris 9(before release of solaris 10 )every services was in the form of file systems.Like for starting SSH, u need to go to /etc/init.d and start a particular service linked with rc.xx scripts.But after the release of Solaris 10 A concept like that of registry in windows was put into this unix flavour...andn this structure we can configure.Seems interestin, right???.Another thing is even we can make a new service
Before that let me put a word on X-manager which is a tool through which we can login to any Unix server in GUI mode.There are some other tools like humming Bird..etc...But X-Manager is preferred due to its flexibility and performance.
Through command svccfg we can configure any service
1)# svccfg
svc:>
from here if you press tab button 2 times you can see which all are the services.and for X-Manager select the service 'cde-login'
-----------------------------------------------------------
2)select cde-login
svc:/application/graphical-login/cde-login>
here again if you give tab button 2 times you can see various options.I have listed property 'arg'
3)svc:/application/graphical-login/cde-login>
add delhash exit inventory repository validate
addpg delpg export list select
apply delprop extract listpg set
archive editprop help listprop setprop
delete end import quit unselect
svc:/application/graphical-login/cde-login> listprop arg
-----------------------------------------------------------
now i set the property dtlogin/args with udp port 177 and quit it
4)svc:/application/graphical-login/cde-login> setprop dtlogin/args=astring:" -udpPort 177"
svc:/application/graphical-login/cde-login> quit
-----------------------------------------------------------
after that i restarted the service cde-login.
5)# svcadm restart cde-login
check for the service if its running or not..you are supposed to get a service running on udpPort 177
and if its running then its done.Else you do one thing.(sshhh.... take a hammer and make it into a round shape..we can play football)
-------------------------------------------------------------
6)# pgrep -lf dtlogin
18325 /usr/dt/bin/dtlogin -daemon -udpPort 177
Now you can login to your server through X-manager
Subscribe to:
Posts (Atom)