Migrating to Community-driven Infrastructure

(Plan for migration / Timeline)
(Introduction: migration#2)
 
(65 intermediate revisions not shown)
Line 1: Line 1:
-
== Introduction ==
+
'''please somebody does a major rework of this page, to a new wiki page like "community maemo.org infra - at IPHH" and clean out all the history data that doesn't give any useful info anymore. Also splitting off the autobuilder stuff into a separate page might help'''
-
[up to date as of 2013-02-08]
+
'''''--[[User:joerg_rw|joerg_rw]] 15:20, 4 June 2013 (UTC)'''''
-
Albeit Nokia's plans about discontinuation of maemo support been known since spring 2012, Nokia gave "Go" to Nemein (service provider on behalf of Nokia) for the real migration work not earlier than 2 weeks before christmas 2012.
+
-
As of January, 18th 2013 the *.maemo.org infrastructure has been consolidated from a 20+ physical servers (aka "irons") to current config and completely migrated to new locations independant of Nokia servers. This task been accomplished by Nemein. Talk.maemo.org forum been integrated with the other infra, many thanks to Nemein for donating the VM for that. Also many thanks to Nemein for this incredible piece of work done during times when others (as well as the guys there) usually are already away for winter holidays.
+
Started to move stuff
-
The current setup (see below) consists of around 10 Virtual Machines hosted by Nemein on their xen-grid. This is an interim solution. Nokia paid Nemein for this consolidation/migration and hosting until end of February.
+
http://wiki.maemo.org/Maemo.org_Infrastructure
-
Handing over control of servers still pending, right now (2013-01-30) it's still Nemein and affiliates to control that infra.  
+
http://wiki.maemo.org/Maemo.org_Infrastructure/Autobuilder
-
Transfer of control over (*.)maemo.org DNS entries ("the domain") is still being negotiated between Nokia and HiFo, all DNS changes done so far been done by Nokia dnsmaster on Nemein's request
+
Please use Categories below for any future pages of this topic
-
 
+
-
The plans of council and HiFo board so far are: kindly ask Nemein to have  *.maemo.org nicely bundled. We hope for this setup to be free of major known bugs (I.E. autobuilder working, repository working albeit maybe slow) when Nemein hands us the package.
+
-
 
+
-
[2013-02-08] Negotiations about direct migration to one of our 3 options (see below) are ongoing.
+
-
 
+
-
===further plans, state of migration===
+
-
Further plans are to migrate again to some hosted root servers, either on a sponsor like http://osuosl.org/about-osuosl or to our own stuff we may rent from e.g. Hetzner.
+
-
 
+
-
[2013-02-08] currently we're in negotiations about 3 possible ways into future hosting:
+
-
* osuosl (could provide VM or rootservers or CoLo [UPS server shipping: 48h:1200EUR, 7d:630EUR, +customs])
+
-
* IPHH, a ISP in Hamburg. Falk contacted them and they are willing to offer CoLo basically free of charge. Of course we will put their name on our maemo.org frontpage to give due credit. HW service will be done by Falk. (costs ~300EUR for setup and HW upgrade, plus 50..300EUR for shipping the iron to Hamburg)
+
-
* get own paid rootservers, like 2 of http://www.hetzner.de/en/hosting/produkte_rootserver/ex10 (costs ~300EUR/month, 400EUR setup)
+
-
Depending on option chosen, we might or might not keep the SuperMicro.
+
-
 
+
-
[2013-02-17] Hildon Foundation board has agreed on following IPHH and keeping OSUOSL open as an alternative if something with our "plan-A" goes awry. Tech staff wholheartedly agrees and will act occordingly, sending iron to IPHH/Falk on 2013-02-18/19, and also negotiating with OSUOSL about how a possible migration to them would look like so we get a decent checklist in case we need it.
+
-
 
+
-
Falk's mail forwarded form IPHH to HiFo:
+
<pre>
<pre>
-
Hi everyone,
+
[[Category:maemo.org]]
 +
[[Category:Infrastructure]]
 +
</pre>
-
these are the details, what IPHH is willing to offer us.
+
'''''--[[User:sixwheeledbeast|sixwheeledbeast]] 20:46, 5 June 2013 (UTC)'''''
-
Best regards,
+
Suggest maintainers should be removed an [[Maemo.org_team]] page should be updated
-
Falk
+
'''''--[[User:sixwheeledbeast|sixwheeledbeast]] 20:50, 5 June 2013 (UTC)'''''
-
Begin forwarded message:
+
== Introduction ==
 +
[2013-11-07] migration to our own servers at IPHH accomplished since ~6 months now, below info is for historical documentation.
-
> From: Rene Sasse <support@iphh.xxx>
+
[up to date as of 2013-02-08]
-
> Subject: [IPHH #442659] Re: maemo.org
+
-
> Date: 18. Februar 2013 11:08:33 MEZ
+
-
> To: falk@fourecks.xxx
+
-
> Cc: joerg@openmoko.xxx
+
-
> Reply-To: support@iphh.xxx
+
-
>
+
-
> Falk,
+
-
>
+
-
> IPHH offers the following services to Hildon Foundation for one year free of  
+
-
> charge:
+
-
>
+
-
> * Colocation/electricity for the following devices:
+
-
> - 1 Server (2RU)
+
-
> - 1 Switch (1RU)
+
-
> * 1 100MBit/s Uplink Port
+
-
> * A /27 IPv4 Network
+
-
>
+
-
> This offer is valid for one year and has to be discussed for renewal after 11
+
-
> month.
+
-
>
+
-
> Legal Contact will be:
+
-
>
+
-
> Hildon Foundation
+
-
> 120 West 10th Street, Erie, PA, 16501, USA
+
-
>
+
-
> Technical Contact will be:
+
-
>
+
-
> Falk Stern (FS7182-RIPE)
+
-
> Rathmann-Cohrs-Straße 12, 21357 Bardowick, Germany
+
-
> Mobile: +49-160-71560xx
+
-
>
+
-
>
+
-
> best regards
+
-
> Rene
+
-
>
+
-
> --
+
-
> Rene Sasse                          E-Mail: support@iphh.xxx
+
-
> Technical Consultant                Tel: +49 (0)40 374919-xx
+
-
> IPHH Internet Port Hamburg GmbH      Fax: +49 (0)40 374919-xx
+
-
> Wendenstrasse 408                    AG Hamburg, HRB 76071
+
-
> D-20537 Hamburg                      Geschaeftsfuehrung: Axel G. Kroeger--7E94C7404EC25FD69CC85C3653348297
+
-
>
+
-
</pre>
+
-
Iron to move form:  ( http://nemein.com/fi/ )
+
Albeit Nokia's plans about discontinuation of maemo support been known since spring 2012, Nokia gave "Go" to Nemein (service provider on behalf of Nokia) for the real migration work not earlier than 2 weeks before christmas 2012.
-
Nemein Oy
+
-
tel. +358 20-198 6030
+
-
Vilhonvuorenkatu 11 D, 8 krs
+
-
00500 Helsinki, FINLAND
+
-
FIN-1647219-2
+
-
    support AT nemein.com
+
-
to ( http://www.iphh.net/en/contact.html )
+
As of January, 18th 2013 the *.maemo.org infrastructure has been consolidated from a 20+ physical servers (aka "irons") to current config and completely migrated to new locations independant of Nokia servers. This task been accomplished by Nemein. Talk.maemo.org forum been integrated with the other infra, many thanks to Nemein for donating the VM for that. Also many thanks to Nemein for this incredible piece of work done during times when others (as well as the guys there) usually are already away for winter holidays.
-
  IPHH Internet Port Hamburg GmbH
+
-
  #444615
+
-
  Wendenstrasse 408
+
-
  20537 Hamburg
+
-
  Germany
+
-
T : +49 40 37 49 19-0
+
-
F : +49 40 37 49 19-29
+
-
E : info@iphh.net
+
-
Package details:
+
The current setup (see below) consists of around 10 Virtual Machines hosted by Nemein on their xen-grid. This is an interim solution. Nokia paid Nemein for this consolidation/migration and hosting until end of February.
-
size
+
-
  x: 100cm
+
-
  y: 66cm
+
-
  z: 28cm
+
-
weight: ~40kg
+
 +
Handing over control of servers still pending, right now (2013-01-30) it's still Nemein and affiliates to control that infra.
-
Shipping accomplished:
+
Transfer of control over (*.)maemo.org DNS entries ("the domain") is still being negotiated between Nokia and HiFo, all DNS changes done so far been done by Nokia dnsmaster on Nemein's request
-
Shipment number 1139212793
+
-
Status from Wed, 20.02.2013 10:57 hours Delivered - signed for by Herr POLROK*
+
-
Recipient TPHH
+
-
Delivered on Herr POLROK*
+
-
via DHL account provided by Nokia/Pekka (many thanks!) on 2013-02-19.
+
-
Courtesy Aslan and Eero of Nemein.
+
-
+
The plans of council and HiFo board so far are: kindly ask Nemein to have  *.maemo.org nicely bundled. We hope for this setup to be free of major known bugs (I.E. autobuilder working, repository working albeit maybe slow) when Nemein hands us the package.
-
Hosting migration timing plan:
+
-
https://docs.google.com/spreadsheet/ccc?key=0AuQnrvWRbTtzdFhERzlDbEp0eVNQQTdfNWpQbzdIT0E&usp=sharing
+
-
 
+
-
 
+
-
Alternatives - however obvious - for the above plans have been discussed with Nemein and HiFo and are not feasible. E.G. there was no way we could get the money instead of the server iron hardware. Sustaining the current xen-grid based VM hosting would be ~1500EUR per month plus a basically not evadable 2200EUR on top for maintenace. We want to switch away from that by all means, thus the 2nd migration.
+
 +
[2013-02-08] Negotiations about direct migration to one of our 3 options (see below) are ongoing.
 +
===further plans, state of migration===
 +
(obsolete. thus deleted. See wiki history if interested in what happened when)
   
   
This page is intended as a central place where status and other operational information can be gathered.
This page is intended as a central place where status and other operational information can be gathered.
-
=== Plan for migration / Timeline ===
+
=== Plan for migration / Timeline [2013-03-15]===
* Friday, 22.2. (falk)
* Friday, 22.2. (falk)
Line 140: Line 56:
* ... (hidden DNS master set up)
* ... (hidden DNS master set up)
** sync databases, switch DNS entries
** sync databases, switch DNS entries
 +
** DNS switched [Nokia] to new IPs on 2013-03-14 1700UTC. Final sync established 1900. since then machines up and running on *new*
VMs we need to migrate:  
VMs we need to migrate:  
{|
{|
-
! Name !! Disk Size !! Location !! _migrated? !! _Comments
+
! Name !! Disk Size !! Location of act. instance !! _migrated? !! _Comments on *new* instance
|+
|+
-
| static || 30G || nemein || copied || (copied!)
+
| static || 30G || nemein || synced+up || works
|+
|+
-
| wiki || 20G || nemein || copied || also bugs. (copied!)
+
| wiki || 20G || nemein || synced+up || works
|+
|+
-
| repository || 900G || nemein || no || We need to check the disk size, this might be too big for current hw, maybe split tablets-dev off. (copying)
+
| repository || 900G || nemein || synced+up || We need to check the disk size, this might be too big for current hw, maybe split tablets-dev off.
|+
|+
-
| mail || 20G || nemein || no || also has lists
+
| mail || 20G || nemein || synced+up || also has lists
|+
|+
-
| scratchbox || 20G || nemein || no || has an NFS mount from db!
+
| scratchbox || 100G || iphh || setup! || will be setup new
|+
|+
-
| vcs || 50G || nemein || copying || has NFS mounts from garage and repository (copying)
+
| vcs || 50G || nemein || synced+up || has NFS mounts from garage and repository (copying)
|+
|+
-
| garage || 100G || nemein || copied || has NFS mounts from stage and vcs (copied!)
+
| garage || 100G || nemein || synced+up || has NFS mounts from stage and vcs (copied, seems to work)
|+
|+
-
| db || 100G || nemein || copied || 73G used, maybe disk needs to be bigger (copied!)
+
| db || 100G || nemein || synced+up || works, needs tuning
|+
|+
-
| builder || 50G || nemein || no ||  
+
| builder || 50G || nemein || copied+up || still needs fixing several aspects
|+
|+
-
| talk || 20G || nemein || no ||  
+
| talk || 20G || nemein || synced+up || up since 2013-03-13, via HTTP-forward
-
|}
+
-
 
+
-
=== Setup with IPHH ===
+
-
 
+
-
==== Networks ====
+
-
 
+
-
We have 2 /28 Subnets (213.128.137.0/28 and 213.128.137.16/28)
+
-
 
+
-
Networks are configured as follows:
+
-
 
+
-
{|
+
-
! IPv4 !! IPv6 !! VLAN !! Xen Bridge !! default GW
+
|+
|+
-
| 213.128.137.0/28 || not yet || 1 || xenbr0 || 213.128.137.14
+
| dns || ?? || ipph || setup! || dns records/serial incomplete, bind inactive
-
|+
+
-
| 213.128.137.16/28 || not yet || 2 || xenbr1 || 213.128.137.17
+
-
|+
+
-
| 10.0.1.0/24 || not yet || 3 || xenbr2 || 10.0.1.1
+
|}
|}
-
IP Plan for vlan 1
+
=== State of final migration ===
-
 
+
all VMs got migrated to IPHH server, DNS still owned and managed by Nokia [2013-05-29]
-
{|
+
-
! IPv4 !! IPv6 !! Hostname
+
-
|+
+
-
| 213.128.137.1 || n/a || firewall-carp
+
-
|+
+
-
| 213.128.137.2 || n/a || firewall-a
+
-
|+
+
-
| 213.128.137.3 || n/a || firewall-b
+
-
|+
+
-
| 213.128.137.4 || n/a || blade-a
+
-
|+
+
-
| 213.128.137.5 || n/a || blade-b
+
-
|+
+
-
| 213.128.137.6 || n/a || portforwarding for monitor
+
-
|+
+
-
| 213.128.137.7 || n/a ||
+
-
|+
+
-
| 213.128.137.8 || n/a ||
+
-
|+
+
-
| 213.128.137.9 || n/a ||
+
-
|+
+
-
| 213.128.137.10 || n/a ||
+
-
|+
+
-
| 213.128.137.11 || n/a ||
+
-
|+
+
-
| 213.128.137.12 || n/a || IPHH Router 1
+
-
|+
+
-
| 213.128.137.13 || n/a || IPHH Router 2
+
-
|+
+
-
| 213.128.137.14 || n/a || IPHH-VRRP
+
-
|}
+
-
 
+
-
IP Plan for vlan 2
+
-
 
+
-
{|
+
-
! IPv4 !! IPv6 !! Hostname
+
-
|+
+
-
| 213.128.137.17 || n/a || firewall-carp
+
-
|+
+
-
| 213.128.137.18 || n/a || firewall-a
+
-
|+
+
-
| 213.128.137.19 || n/a || firewall-b
+
-
|+
+
-
| 213.128.137.20 || n/a || static/www
+
-
|+
+
-
| 213.128.137.21 || n/a || wiki
+
-
|+
+
-
| 213.128.137.22 || n/a || repository
+
-
|+
+
-
| 213.128.137.23 || n/a || mail
+
-
|+
+
-
| 213.128.137.24 || n/a || scratchbox
+
-
|+
+
-
| 213.128.137.25 || n/a || vcs
+
-
|+
+
-
| 213.128.137.26 || n/a || garage
+
-
|+
+
-
| 213.128.137.27 || n/a || builder
+
-
|+
+
-
| 213.128.137.28 || n/a || talk
+
-
|+
+
-
| 213.128.137.29 || n/a || DNS
+
-
|+
+
-
| 213.128.137.30 || n/a ||
+
-
|}
+
-
 
+
-
IP Plan for vlan 3
+
-
 
+
-
{|
+
-
! IPv4 !! IPv6 !! Hostname
+
-
|+
+
-
| 10.0.1.1 || n/a || firewall-carp
+
-
|+
+
-
| 10.0.1.2 || n/a || firewall-a
+
-
|+
+
-
| 10.0.1.3 || n/a || firewall-b
+
-
|+
+
-
| 10.0.1.10 || n/a || db
+
-
|+
+
-
| 10.0.1.11 || n/a || monitor
+
-
|+
+
-
| 10.0.1.200 || n/a || blade-a/IPMI
+
-
|+
+
-
| 10.0.1.201 || n/a || blade-b/IPMI
+
-
|+
+
-
| 10.0.1.202 || n/a || maemo-switch
+
-
|}
+
-
 
+
-
==== Disk Layout of blade-[ab] ====
+
-
 
+
-
Both disks have the following partitioning:
+
-
RAID1 Volume for /boot (/dev/md0), consisting of /dev/sda1 and /dev/sdb1 (200M)
 
-
RAID1 Volume /dev/md1 consisting of /dev/sda2 and /dev/sdb2 (around 970G)
 
-
The RAID1 Volume contains a physical LVM volume.
 
-
We only have one VolumeGroup (vg_blade[ab]), which has LogVol00 with 20G as root volume, LogVol01 with 2 Gig as swap and vmstore with the rest as VM Storage mounted on /vmstore.
 
==== Tips & Tricks for migration ====
==== Tips & Tricks for migration ====
Line 301: Line 107:
<pre>cd / ; rsync -arvSxz . root@host:/mount/point</pre>
<pre>cd / ; rsync -arvSxz . root@host:/mount/point</pre>
-
== Inventory (obsolete, please update) ==
+
==== Stuff to do [2013-03-15] ====  
-
As a first step we try to gather information about the present infrastructure at *.maemo.org. This "inventory" is intended to provide an overview about all components of the infrastructure as well as to provide information that will later on aid during the actual migration.
+
* Implement a proper service monitoring for all machines and applications - nagios pending, http://monitor.maemo.org/ganglia/
 +
* Setup a common policy for root/user accounts and sudo permissions
 +
* Change root-passwords - done
 +
* Make SSH root-login key-only - done?
 +
* Find out, what to sync for final migration - done
 +
* Configure internal DNS server in /etc/resolv.conf
 +
* Coordinate DNS setup with Nokia - partially done
 +
* Consolidate Databases - WIP
 +
* Add disks to system - done, 4TB on blade-a
 +
* Setup bugtracking system for infrastructure - done: roundup?
 +
* fix NFS mounts - WIP
 +
* update VMs to 3.2.0-38
-
Currently the following topics are considered important for the migration:
+
==== Problems we walked into ====
 +
 
 +
===== Machines throwing their network away =====
 +
 
 +
Apparently, XEN has issues if a vm sends too many/too large network packets.
 +
 
 +
http://lists.xen.org/archives/html/xen-devel/2013-01/msg00198.html
 +
has an interesting read about that problem.
 +
 
 +
Symptom:
 +
 
 +
  xenbr1: port 8(vif51.0) entered forwarding state
 +
  vif vif-51-0 vif51.0: Too many frags
 +
  vif vif-51-0 vif51.0: fatal error; disabling device
 +
  xenbr1: port 8(vif51.0) entered disabled state
 +
 
 +
in dmesg
 +
 
 +
Temporary fix: Disable all offloading on eth0
 +
 
 +
  for i in rx tx sg tso gso gro lro; do
 +
          ethtool -K eth0 $i off
 +
  done
 +
 
 +
Source of this problem:
 +
 
 +
https://git.kernel.org/cgit/linux/kernel/git/stable/linux-stable.git/commit/include/linux/skbuff.h?h=v3.3.1&id=9d4dde5215779f4099730194ad30624fdba3d8b2
 +
 
 +
 
 +
We fixed that problem on our machines by ensuring dom0 and domU use same MAX_SKB_FRAGS
 +
 
 +
== Inventory ==
 +
 
 +
As a first step we gathered information about the former infrastructure at *.maemo.org. This "inventory" provided an overview about all components of the infrastructure as well as information that would later on aid during the migration.
 +
 
 +
The following topics were considered important for the migration:
* Legal Issues (Names, Trademarks, Domain Names, etc.)
* Legal Issues (Names, Trademarks, Domain Names, etc.)
* Infrastructure (Web Site, Forum, Wiki, Autobuilder, Mailinglists, Garage, etc.)
* Infrastructure (Web Site, Forum, Wiki, Autobuilder, Mailinglists, Garage, etc.)
-
 
== Legal Issues ==
== Legal Issues ==
Line 354: Line 205:
== Operational Platform ==
== Operational Platform ==
-
For now (2013-01,02) all services are implemented as VM running on Nemein's xen-grid. Plan is to migrate those VM to a virtualization installed on the serevrs (see below) that Nokia donates to the community. This hardware is located at Nemein office right now (2013-02-16) and '''should''' get shipped to iphh.net in Hamburg on Monday 2013-02-18.
+
[2013-03-20] All of maemo.org is running on our supermicro server colocated at IPHH
List of hardware Nokia will donate to HiFo, according to Nemein's plans. [2013-02-08]
List of hardware Nokia will donate to HiFo, according to Nemein's plans. [2013-02-08]
Line 378: Line 229:
|-
|-
| 01
| 01
-
|  
+
| blade-a.maemo.org
|  
|  
|  
|  
Line 386: Line 237:
| http://www.supermicro.nl/products/system/2u/2027/SYS-2027TR-HTRF.cfm?parts=SHOW
| http://www.supermicro.nl/products/system/2u/2027/SYS-2027TR-HTRF.cfm?parts=SHOW
| 2U 19" Rackmount
| 2U 19" Rackmount
-
| 2 * Intel® Xeon® processor E5-2620
+
| Intel® Xeon® processor E5-2620
-
| 2 * 32GB
+
| 32GB
-
| 2 * (raid1 2 * 1TB)
+
| (raid1:2*)1TB, 2*2TB=4TB aux.
|
|
| 3 years
| 3 years
Line 395: Line 246:
|-
|-
| 02
| 02
-
|  
+
| blade-b.maemo.org
|  
|  
|  
|  
Line 403: Line 254:
|  
|  
|  
|  
-
|  
+
| Intel® Xeon® processor E5-2620
-
|
+
| 32GB
-
|  
+
| (raid1:2*)1TB
|
|
|
|
Line 411: Line 262:
|}
|}
 +
http://wstaw.org/m/2013/06/02/plasma-desktopVf3743.png
=== OS and virtulization on community iron (planning, discussion) ===
=== OS and virtulization on community iron (planning, discussion) ===
Please don't forget to tag your contributions with your nick!
Please don't forget to tag your contributions with your nick!
Line 420: Line 272:
-
====Virtulization====
+
====Virtualization====
===== alternative A =====
===== alternative A =====
XEN (with OS blabla of above)
XEN (with OS blabla of above)
Line 621: Line 473:
=== More Detailed Information ===
=== More Detailed Information ===
 +
''this is based on what we got/found on Nemein temporary infra - it's NOT related/applicable to new IPHH infra. For IPHH new infra IPs [[Maemo.org_Infrastructure]]''
In this sub section more detailed information about the entries in the table can be placed. The intent is to keep the table concise while still being able to have all relevant information at hand.
In this sub section more detailed information about the entries in the table can be placed. The intent is to keep the table concise while still being able to have all relevant information at hand.
Line 635: Line 488:
  188.117.59.202      wiki.maemo.org
  188.117.59.202      wiki.maemo.org
  188.117.59.212      bugs.maemo.org
  188.117.59.212      bugs.maemo.org
-
  188.117.59.203      repository.maemo.org
+
  # 188.117.59.203      repository.maemo.org scrubbed
-
  188.117.59.205      stage.maemo.org
+
  188.117.59.205      stage.maemo.org repository.maemo.org (reassigned)
  188.117.59.206      vcs.maemo.org
  188.117.59.206      vcs.maemo.org
Line 714: Line 567:
| maemo || warfare || Falk Stern || falk<at>fourecks.de || (maemo master sysop) || ||
| maemo || warfare || Falk Stern || falk<at>fourecks.de || (maemo master sysop) || ||
|+
|+
-
| maemo || chemist || Ruediger Schiller || chemist<at><to-be-disclosed-by-owner> || Talk || || ||
+
| maemo || chemist || Ruediger Schiller || webmaster<at>talk.m.o || Talk || || ||
|+
|+
| maemo || merlin1991 || Christian Ratzenhofer || <at> || Repos || || [preliminary accepted] ||
| maemo || merlin1991 || Christian Ratzenhofer || <at> || Repos || || [preliminary accepted] ||
Line 723: Line 576:
|+
|+
| || || || || (planet???) || ||
| || || || || (planet???) || ||
-
|}  
+
|}
 +
 
 +
== Unsorted Hints ==
 +
=== ssh access ===
 +
All legacy accounts got ported to new infra.
 +
 
 +
Access to any VM is via plain direct ssh:
 +
ssh <user>@<VM>.maemo.org
 +
 
 +
=== backup ===
 +
we're doing backups to the 4TB auxiliary storage on blade-a, using backupPC:
 +
ssh -L8088:localhost:80 blade-a
 +
konqueror http://localhost:8088
 +
backup-master is Falk
 +
 
 +
talk VM sysop (chem|st) has access to it and control over own backups, via ssh confic on blade-a:
 +
command="sleep 1d",permitopen="127.0.0.1:80"  <ssh-pubkey>
== Steering ==
== Steering ==
 +
council is in charge of any steering.  
council is in charge of any steering.  
 +
Joerg Reisenweber got appointed for "maemo.org infra administration coordinator" and thus is the single point of coordination for any detail questions.
Joerg Reisenweber got appointed for "maemo.org infra administration coordinator" and thus is the single point of coordination for any detail questions.
Line 734: Line 605:
* OBS @ TiZen or SuSe : https://bugs.tizen.org/jira/browse/TINF-48?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
* OBS @ TiZen or SuSe : https://bugs.tizen.org/jira/browse/TINF-48?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 +
 +
* links: http://wiki.maemo.org/Maemo.org_team/Updating_maemo.org_sites
 +
 +
[[Category:maemo.org]]
 +
[[Category:Infrastructure]]

Latest revision as of 14:35, 7 November 2013

please somebody does a major rework of this page, to a new wiki page like "community maemo.org infra - at IPHH" and clean out all the history data that doesn't give any useful info anymore. Also splitting off the autobuilder stuff into a separate page might help --joerg_rw 15:20, 4 June 2013 (UTC)

Started to move stuff

http://wiki.maemo.org/Maemo.org_Infrastructure

http://wiki.maemo.org/Maemo.org_Infrastructure/Autobuilder

Please use Categories below for any future pages of this topic

[[Category:maemo.org]]
[[Category:Infrastructure]]

--sixwheeledbeast 20:46, 5 June 2013 (UTC)

Suggest maintainers should be removed an Maemo.org_team page should be updated

--sixwheeledbeast 20:50, 5 June 2013 (UTC)

Contents

[edit] Introduction

[2013-11-07] migration to our own servers at IPHH accomplished since ~6 months now, below info is for historical documentation.

[up to date as of 2013-02-08]

Albeit Nokia's plans about discontinuation of maemo support been known since spring 2012, Nokia gave "Go" to Nemein (service provider on behalf of Nokia) for the real migration work not earlier than 2 weeks before christmas 2012.

As of January, 18th 2013 the *.maemo.org infrastructure has been consolidated from a 20+ physical servers (aka "irons") to current config and completely migrated to new locations independant of Nokia servers. This task been accomplished by Nemein. Talk.maemo.org forum been integrated with the other infra, many thanks to Nemein for donating the VM for that. Also many thanks to Nemein for this incredible piece of work done during times when others (as well as the guys there) usually are already away for winter holidays.

The current setup (see below) consists of around 10 Virtual Machines hosted by Nemein on their xen-grid. This is an interim solution. Nokia paid Nemein for this consolidation/migration and hosting until end of February.

Handing over control of servers still pending, right now (2013-01-30) it's still Nemein and affiliates to control that infra.

Transfer of control over (*.)maemo.org DNS entries ("the domain") is still being negotiated between Nokia and HiFo, all DNS changes done so far been done by Nokia dnsmaster on Nemein's request

The plans of council and HiFo board so far are: kindly ask Nemein to have *.maemo.org nicely bundled. We hope for this setup to be free of major known bugs (I.E. autobuilder working, repository working albeit maybe slow) when Nemein hands us the package.

[2013-02-08] Negotiations about direct migration to one of our 3 options (see below) are ongoing.

[edit] further plans, state of migration

(obsolete. thus deleted. See wiki history if interested in what happened when)

This page is intended as a central place where status and other operational information can be gathered.

[edit] Plan for migration / Timeline [2013-03-15]

  • Friday, 22.2. (falk)
    • Rack Hardware @ IPHH - Hardware is racked
    • Install base system (CentOS 6.3 with patches from xes)
  • Saturday, 23.2. (xes/falk)
    • Start migrating repository.m.o
    • Start migrating VMs with static data
  • ... (hidden DNS master set up)
    • sync databases, switch DNS entries
    • DNS switched [Nokia] to new IPs on 2013-03-14 1700UTC. Final sync established 1900. since then machines up and running on *new*

VMs we need to migrate:

Name Disk Size Location of act. instance _migrated? _Comments on *new* instance
static 30G nemein synced+up works
wiki 20G nemein synced+up works
repository 900G nemein synced+up We need to check the disk size, this might be too big for current hw, maybe split tablets-dev off.
mail 20G nemein synced+up also has lists
scratchbox 100G iphh setup! will be setup new
vcs 50G nemein synced+up has NFS mounts from garage and repository (copying)
garage 100G nemein synced+up has NFS mounts from stage and vcs (copied, seems to work)
db 100G nemein synced+up works, needs tuning
builder 50G nemein copied+up still needs fixing several aspects
talk  20G nemein  synced+up  up since 2013-03-13, via HTTP-forward
dns  ?? ipph setup! dns records/serial incomplete, bind inactive

[edit] State of final migration

all VMs got migrated to IPHH server, DNS still owned and managed by Nokia [2013-05-29]


[edit] Tips & Tricks for migration

Copying:

Create an image on vmhost

fallocate -l 200g image.img

or, in case fallocate is unavailable

dd if=/dev/zero of=image.img bs=1 count=1 seek=200G

Attach as loop-device

losetup -f image.img
(find the loop-device and create a filesystem on it)

Copy stuff

tar --create -p -j --one-file-system . | pv -br | ssh root@host 'cd /mountpoint  ; tar xpj '
or
cd / ; rsync -arvSxz . root@host:/mount/point

[edit] Stuff to do [2013-03-15]

  • Implement a proper service monitoring for all machines and applications - nagios pending, http://monitor.maemo.org/ganglia/
  • Setup a common policy for root/user accounts and sudo permissions
  • Change root-passwords - done
  • Make SSH root-login key-only - done?
  • Find out, what to sync for final migration - done
  • Configure internal DNS server in /etc/resolv.conf
  • Coordinate DNS setup with Nokia - partially done
  • Consolidate Databases - WIP
  • Add disks to system - done, 4TB on blade-a
  • Setup bugtracking system for infrastructure - done: roundup?
  • fix NFS mounts - WIP
  • update VMs to 3.2.0-38

[edit] Problems we walked into

[edit] Machines throwing their network away

Apparently, XEN has issues if a vm sends too many/too large network packets.

http://lists.xen.org/archives/html/xen-devel/2013-01/msg00198.html has an interesting read about that problem.

Symptom:

 xenbr1: port 8(vif51.0) entered forwarding state
 vif vif-51-0 vif51.0: Too many frags
 vif vif-51-0 vif51.0: fatal error; disabling device
 xenbr1: port 8(vif51.0) entered disabled state

in dmesg

Temporary fix: Disable all offloading on eth0

 for i in rx tx sg tso gso gro lro; do
         ethtool -K eth0 $i off
 done

Source of this problem:

https://git.kernel.org/cgit/linux/kernel/git/stable/linux-stable.git/commit/include/linux/skbuff.h?h=v3.3.1&id=9d4dde5215779f4099730194ad30624fdba3d8b2


We fixed that problem on our machines by ensuring dom0 and domU use same MAX_SKB_FRAGS

[edit] Inventory

As a first step we gathered information about the former infrastructure at *.maemo.org. This "inventory" provided an overview about all components of the infrastructure as well as information that would later on aid during the migration.

The following topics were considered important for the migration:

  • Legal Issues (Names, Trademarks, Domain Names, etc.)
  • Infrastructure (Web Site, Forum, Wiki, Autobuilder, Mailinglists, Garage, etc.)

[edit] Legal Issues

[edit] What is the state about the name "Maemo"?

"... Maemo is currently a registered trademark of Nokia and the domain name is owned by Nokia.

[edit] Who owns "maemo.org"?

Negotiations about domain ownership still ongoing between Hildon Foundation board and Nokia (2013-01-20), if community can't get control over the DNS, we might revert to maemocommunity.org.

Domain ID:D105692361-LROR
Domain Name:MAEMO.ORG
Created On:07-Feb-2005 16:26:32 UTC
Last Updated On:07-Jan-2013 10:25:55 UTC
Expiration Date:07-Feb-2014 16:26:32 UTC
Sponsoring Registrar:MarkMonitor Inc. (R37-LROR)
Registrant ID:mmr-31461
Registrant Name:Nokia Corporation
Registrant Organization:Nokia Corporation
Registrant Street1:P.O.Box 226
Registrant Street2:Nokia Group
Registrant Postal Code:00045
Registrant Country:FI
Registrant Phone:+358.718008000
Registrant FAX:+358.718034496
Registrant Email:dnsauthority@nokia.com


We're planning to ask Nokia to allow a hidden primary [1] for maemo.org, that we will host on a persistent VM (dns) sponsored by Nemein (thanks Eero! :-D ). The purpose is to allow swift changes of IPs under maemo.org without bothering Nokia's DNSmaster, as long as the domain still belongs to Nokia. Once the domain will get transferred to HiFo, this will become less useful but also not exactly any problem. in 6 months or so we can consider tearing down the hidden primary and manage our domain directly.

[edit] What is needed for the community to run maemo.org?

TMO forums donated to Hildon Foundation: http://maemo.org/community/board/tmo_forums_donated_to_hildon_foundation/

[edit] What are the costs?

Nokia paid for hosting until end of February. Current (2013-01-30) interim config (VM on Nemein's xen-grid) will cost 1300EUR/month for the VM, plus 2200EUR/month for the maintenance. For the colocation rackspace, traffic, energy etc of the iron(s) Nokia donates to community there will be another 500+EUR/month. All excl VAT.

At end of February we hope to drop the xen-grid VM since they shall run in a virtualization on our iron by then.

If you're willing to donate, please visit http://hildonfoundation.org/support/

[edit] What about the personal information of the users?

Please refer to the privacy policy posted on the website. If you want info about what's the data stored about you inside *maemo.org, or want this data / your account getting permanently deleted, please contact council@maemo.org

[edit] Operational Platform

[2013-03-20] All of maemo.org is running on our supermicro server colocated at IPHH

List of hardware Nokia will donate to HiFo, according to Nemein's plans. [2013-02-08]

ID Hostname Mgmt IP Address OOB Mgmt IP Address Type (Virtual / Baremetal) System Admin HW Vendor HW Model Form Factor CPU Memory Disk Acquisition Date Warranty Services Comment
01 blade-a.maemo.org Baremetal Falk(warfare) Supermicro http://www.supermicro.nl/products/system/2u/2027/SYS-2027TR-HTRF.cfm?parts=SHOW 2U 19" Rackmount Intel® Xeon® processor E5-2620 32GB (raid1:2*)1TB, 2*2TB=4TB aux. 3 years Falk (for HH CoLo) only 2 of the 4 blades populated
02 blade-b.maemo.org Baremetal Intel® Xeon® processor E5-2620 32GB (raid1:2*)1TB

http://wstaw.org/m/2013/06/02/plasma-desktopVf3743.png

[edit] OS and virtulization on community iron (planning, discussion)

Please don't forget to tag your contributions with your nick!

[edit] Server OS

[edit] alternative A

blabla-OS

[edit] alternative B
[edit] alternative C

[edit] Virtualization

[edit] alternative A

XEN (with OS blabla of above)

[edit] alternative B

VMware

[edit] alternative C

[edit] Services

The following table is intended to give a concise and easily perceivable overview of the *.maemo.org services. Please use the next sub-section for providing more detailed information.

Resource URL (If Applicable) Migration Status (DONE/WIP/NST) Service Maintainer System Admin Software Name Software Version Software License Known Issues Last status update
Maemo Main Web Site http://www.maemo.org 1 BUGS  ? Nemein orphaned links/404s: http://maemo.org/community/council/system_operator_needed/; Login doesn't work 2013-01-25
Maemo Forums http://talk.maemo.org 1 DONE chemist, Reggie Falk, chemist vBulletin Unlimited duration, no upgrades included, acquired on 2012-20-12 Captcha image issues 2013-02-10
Maemo Wiki http://wiki.maemo.org 1 BUGS  ? Nemein (Watch) Email not working; random connection timeouts 2013-01-25
Repositories http://repository.maemo.org 1 BUGS X-Fade, Merlin1981 Nemein former akamai serverfarm, now points to stage.m.o VM master of farm. Hashsum errors legacy 2013-02-20
Blog aggregator http://planet.maemo.org 1 DONE  ? Nemein login flawed? 2013-02-10
Maemo Garage https://garage.maemo.org/ 1 DONE  ?, Woody Nemein 2013-01-25
Maemo Autobuilder 1 NST X-Fade Nemein OFFLINE, x-fade working on it 2013-02-20
Maemo Nameservers 1 WIP Merlin, Falk Nokia Still using Nokia Nameservers; following hidden primary plan til domain transfer to HiFo established 2013-01-25
Drop http://drop.maemo.org 1 WIP X-Fade Nemein 2013-02-10
VCS http://vcs.maemo.org 1 WIP Nemein 2013-02-10
Listserv https://lists.maemo.org 1 BUGS Nemein occasional lockups resp interface down 2013-02-20
Static http://static.maemo.org 1 WIP Nemein temporary fix via NAT port81 redir, instable? 2013-02-20
Stage http://stage.maemo.org obsolete X-Fade Nemein VM got assigned to repository.m.o 2013-02-20
Bugs http://bugs.maemo.org 1 DONE Andre Nemein - 2013-01-25
Scratchbox http://scratchbox.org/ 1 WIP thedead1440 Nemein, thedead1440 80.248.164.245, Logica Finland Oy, migration pending 2013-02-20
Voting Infrastructure  ? 1 WIP woody14619  ?  ? 2013-02-20

[edit] More Detailed Information

this is based on what we got/found on Nemein temporary infra - it's NOT related/applicable to new IPHH infra. For IPHH new infra IPs Maemo.org_Infrastructure In this sub section more detailed information about the entries in the table can be placed. The intent is to keep the table concise while still being able to have all relevant information at hand.

List of VMs and their associated IPs:

IP adresses
188.117.59.198      test.maemo.org
# www.maemo.org      maemo.org
188.117.59.200      www.maemo.org
188.117.59.200      planet.maemo.org
188.117.59.200      static.maemo.org
188.117.59.199      drop.maemo.org
188.117.59.207      garage.maemo.org
188.117.59.204      lists.maemo.org
188.117.59.202      wiki.maemo.org
188.117.59.212      bugs.maemo.org
# 188.117.59.203      repository.maemo.org  scrubbed
188.117.59.205      stage.maemo.org  repository.maemo.org (reassigned)
188.117.59.206      vcs.maemo.org

List of internal IP/VM

127.0.0.1  MaemoTemplate
10.0.0.1   maemo static maintenance
10.0.0.2   wiki bugs
10.0.0.121   stage repository
10.0.0.4   mail smtp lists
10.0.0.5   scratchbox
10.0.0.6   dns
#10.0.0.7   repository
10.0.0.9   vcs drop
10.0.0.10  garage
10.0.0.11  db backup
10.0.0.12  builder
10.0.0.254 fw

Cpu Cores, RAM (in MB), storage (DISK, in GB), of the VMs

Current VMs actually in use (some more were reserved originally since it
was not certain what services could be merged)

Name    C   RAM     DISK
------------------------
MaemoFW 1   1024    10
Builder 1   4096    150
garage  2   8192    100
test    2   2048    30
wikib   2   2048    50
www     2   6144    70
vcs     2   8192    200
db      2   8192    260
mail    2   2048    30
stage   2   2048    870
talk    2   4096    15
========================
        20  48128   1785

sb      2   2048    30
dns     2   2048    30
========================
        25  52224   1845

[edit] Forum (talk.maemo.org)

Unlike the other services, talk.maemo.org is not behind the endian firewall. Maintenence access is not via test jumpserver.

Software: vBulletin
licence: Unlimited duration, no upgrades included, acquired on 2012-20-12

[edit] Scratchbox

Scratchbox is also sponsored by Nokia. (Please verify?) Scratchbox is required for running the Fremantle and Harmattan SDK.

Currently there's a VM on Nemein's xen-grid named "scratchbox", but state of the case is unclear.

[edit] Tracker for Sysops and Maintainers

This tracker is meant for maemo staff and affiliated only

web frontend: roundup.fourecks.de/maemo/
mail access (read docs!): maemo-issue AT fourecks.de

[edit] Service Maintainers (please update/augment/fix)

(please don't usually pester maintainers directly! First try to contact council@maemo.org, we'll forward)

These are the Service Maintainers (in spe), for services like forum (tmo), wiki, bugs, etc. They are (generally) not sysops of the machines their service is running on.

From Nick Full Name E-Mail Services Maintained Status Comments
Nemein mashiara Rambo Eero af Heurlin eero.afheurlin at <to be disclosed by owner> (sysop) [leaving?]
Nemein x-fade Niels Breet Niels<at>maemo.org (mail, IRC, builder, ???...) [leaving?]
Nemein ferenc Ferenc Szekely ferenc<at>maemo.org (mail, sysop, ???...) [leaving?]
maemo warfare Falk Stern falk<at>fourecks.de (maemo master sysop)
maemo chemist Ruediger Schiller webmaster<at>talk.m.o Talk
maemo merlin1991 Christian Ratzenhofer <at> Repos [preliminary accepted]
 ??? andre_ Andre Klapper  ???<at>??? Bugs [???]
 ??? (wiki)
(planet???)

[edit] Unsorted Hints

[edit] ssh access

All legacy accounts got ported to new infra.

Access to any VM is via plain direct ssh:

ssh <user>@<VM>.maemo.org

[edit] backup

we're doing backups to the 4TB auxiliary storage on blade-a, using backupPC:

ssh -L8088:localhost:80 blade-a
konqueror http://localhost:8088

backup-master is Falk

talk VM sysop (chem|st) has access to it and control over own backups, via ssh confic on blade-a:

command="sleep 1d",permitopen="127.0.0.1:80"  <ssh-pubkey>

[edit] Steering

council is in charge of any steering.

Joerg Reisenweber got appointed for "maemo.org infra administration coordinator" and thus is the single point of coordination for any detail questions.

If you got any questions, suggestions, critics, whatever, please contact Joerg (DocScrutinizer) or any other of council members via IRC. or send a mail to council AT maemo.org. We're just community's proxies acting in best intention to do what's probably community's best interest. If you don't agree with what we do or have suggestions how we could do better, please holler. Best place: Friday 1800UTC IRC:(freenode.net)#maemo-meeting

[edit] More