FusionAuth
    • Home
    • Categories
    • Recent
    • Popular
    • Pricing
    • Contact us
    • Docs
    • Login
    1. Home
    2. venkata.dorisala
    V
    • Profile
    • Following 0
    • Followers 0
    • Topics 1
    • Posts 6
    • Best 0
    • Controversial 0
    • Groups 0

    venkata.dorisala

    @venkata.dorisala

    0
    Reputation
    3
    Profile views
    6
    Posts
    0
    Followers
    0
    Following
    Joined Last Online

    venkata.dorisala Unfollow Follow

    Latest posts made by venkata.dorisala

    • RE: FusionAuth with intermittent node restarts

      @dan

      We do have other pods running in our AKS cluster but they don't restart. It's happening only with fusionauth.

      It's possible that database connection might be cut off but again this isn't something we are managing. I think fusionauth is managing the connection pool.

      I spin up fusionauth in my laptop (8 CPU and 32 GB RAM) using docker compose from documentation, i left my laptop ON for whole night and below are my findings.

      1. In Task manager, i see that Vmmem increased from 1125 to 1495MB over night. I'm not running any other containers. so i think fusion and postgres containers are using this memory. Memory isn't being reclaimed but it's increasing steadily even though there is no traffic at all.

      2. As per the logs below, Fusion was up at 8 PM but node got removed at 8 AM due to thread starvation.

      fusionauth_1 | 2021-03-18 8:01:51.334 PM INFO io.fusionauth.api.service.system.NodeService - Node [0f5e6f11-1da1-4382-9fa4-1c259cff22e4] promoted to master at [2021-03-18T20:01:51.334041Z]
      fusionauth_1 | 2021-03-19 8:02:29.504 AM WARN com.zaxxer.hikari.pool.HikariPool - HikariPool-1 - Thread starvation or clock leap detected (housekeeper delta=8h2m2s443ms338?s400ns).
      fusionauth_1 | 2021-03-19 8:02:29.591 AM INFO io.fusionauth.api.service.system.NodeService - Node [0f5e6f11-1da1-4382-9fa4-1c259cff22e4] with address [http://fusionauth:9011] removed because it has not checked in for the last [28944] seconds. Bye node.
      fusionauth_1 | 2021-03-19 8:06:31.046 AM WARN com.zaxxer.hikari.pool.HikariPool - HikariPool-1 - Thread starvation or clock leap detected (housekeeper delta=4m1s541ms835?s).
      fusionauth_1 | 2021-03-19 8:06:31.101 AM INFO io.fusionauth.api.service.system.NodeService - Node [0f5e6f11-1da1-4382-9fa4-1c259cff22e4] with address [http://fusionauth:9011] removed because it has not checked in for the last [241] seconds. Bye node.

      I think it's easy to reproduce on any machine. So far we have reproduced it in Kubernetes, App Service and locally in a laptop.

      posted in Q&A
      V
      venkata.dorisala
    • RE: FusionAuth with intermittent node restarts

      @dan

      What can you tell me about the nodes you are running (in terms of CPU and memory)?
      A: each node 2 Core and 3.5 GB RAM
      Do the restarts happen on any kind of pattern?
      A: Restart happens randomly but after the system has been idle for few hours.
      Are there things that happen to stress the system at the time of the restarts?
      A: No, our system is used very lightly. Hardly used for couple of hours in a day.
      What version of FusionAuth are you on?
      A. 1.23.3, But we have been experiencing this since over a year.
      What troubleshooting steps have you taken? What have you found?
      A. We just tried to check the logs and shared with you whatever is found.
      It looks like some health check is failing at fusion of which we don't have any details. If you can point us to the documentation which explains us about how load balancing works at fusion that might be helpful.

      Looks like only Premium plan offers engineering support. So do we get support for whole month for that 500$ a month or are there any number of support hours limit per month?

      posted in Q&A
      V
      venkata.dorisala
    • FusionAuth with intermittent node restarts

      @dan

      Thanks for fixing the IP address UI issue. Since we are still facing intermittent node restarts we decided to host fusionauth in Azure AppService to see if overrule the AKS cluster. We are still experiencing fusion node restarts every day.

      We managed to capture logs. Can you take a look and let us know what could be causing these restarts. Attached screenshot tells us when exactly the nodes got restarted and attached logs proves the restart.

      935eee1b-5561-47d8-9eb0-ec36f60f959e-image.png

      Level,ResultDescription
      Informational," 2021-03-06 3:07:29.140 PM INFO  io.fusionauth.api.service.system.NodeService - Node [b43e256c-6c5f-4615-b4eb-c1ae5ccf460c] with address [http://172.16.0.2:9011] removed because it has not checked in for the last [83] seconds. Bye node.
      
      "
      Informational," 2021-03-06 3:11:34.150 PM INFO  io.fusionauth.api.service.system.NodeService - Node [b43e256c-6c5f-4615-b4eb-c1ae5ccf460c] promoted to master at [2021-03-06T15:11:34.149960Z], the previous master Node [22aedd8d-4d00-456d-a8f7-22882cb9b5c5] had not checked in since [2021-03-06T15:05:37.306Z]
      
      "
      Error," NOTE: Picked up JDK_JAVA_OPTIONS:  --add-opens=java.base/java.lang=ALL-UNNAMED --add-opens=java.base/java.io=ALL-UNNAMED --add-opens=java.rmi/sun.rmi.transport=ALL-UNNAMED
      
      "
      Error," 06-Mar-2021 15:14:03.441 INFO [main] org.apache.catalina.core.AprLifecycleListener.lifecycleEvent The Apache Tomcat Native library which allows using OpenSSL was not found on the java.library.path: [/usr/java/packages/lib:/usr/lib64:/lib64:/lib:/usr/lib]
      
      "
      Error," 06-Mar-2021 15:14:04.266 INFO [main] org.apache.coyote.AbstractProtocol.init Initializing ProtocolHandler [""http-nio-9011""]
      
      "
      Error," 06-Mar-2021 15:14:04.310 INFO [main] org.apache.tomcat.util.net.NioSelectorPool.getSharedSelector Using a shared selector for servlet write/read
      
      "
      Error," 06-Mar-2021 15:14:04.350 INFO [main] org.apache.coyote.AbstractProtocol.init Initializing ProtocolHandler [""https-jsse-nio-9013""]
      
      "
      Error," 06-Mar-2021 15:14:04.966 INFO [main] org.apache.tomcat.util.net.NioSelectorPool.getSharedSelector Using a shared selector for servlet write/read
      
      "
      Error," 06-Mar-2021 15:14:04.972 INFO [main] org.apache.coyote.AbstractProtocol.init Initializing ProtocolHandler [""ajp-nio-127.0.0.1-9019""]
      
      "
      Error," 06-Mar-2021 15:14:04.979 INFO [main] org.apache.tomcat.util.net.NioSelectorPool.getSharedSelector Using a shared selector for servlet write/read
      
      "
      Error," 06-Mar-2021 15:14:04.983 INFO [main] org.apache.catalina.startup.Catalina.load Initialization processed in 2299 ms
      
      "
      Error," 06-Mar-2021 15:14:05.000 INFO [main] org.apache.catalina.core.StandardService.startInternal Starting service [Catalina]
      
      "
      Error," 06-Mar-2021 15:14:05.002 INFO [main] org.apache.catalina.core.StandardEngine.startInternal Starting Servlet Engine: Apache Tomcat/8.5.57
      
      "
      Error," Mar 06, 2021 3:14:09 PM org.apache.jasper.servlet.TldScanner scanJars
      
      "
      Error," INFO: At least one JAR was scanned for TLDs yet contained no TLDs. Enable debug logging for this logger for a complete list of JARs that were scanned but no TLDs were found in them. Skipping unneeded JARs during scanning can improve startup time and JSP compilation time.
      
      "
      Informational," 2021-03-06 3:14:10.434 PM INFO  io.fusionauth.api.configuration.DefaultFusionAuthConfiguration - Loading FusionAuth configuration file [/usr/local/fusionauth/config/fusionauth.properties]
      
      "
      Informational," 2021-03-06 3:14:10.459 PM INFO  io.fusionauth.api.configuration.DefaultFusionAuthConfiguration - Dynamically set property [fusionauth-app.url] set to [http://172.16.0.2:9011]
      
      "
      Informational," 
      
      "
      Informational,"   - Overriding default value of property [FUSIONAUTH_APP_RUNTIME_MODE] with value [production]
      
      "
      Informational,"   - Overriding default value of property [search.type] with value [database]
      
      "
      Informational,"   - Overriding default value of property [database.mysql.connector-version] with value [8.0.21]
      
      "
      Informational,"   - Overriding default value of property [database.mysql.enforce-utf8mb4] with value [true]
      
      "
      Informational," 2021-03-06 3:14:10.473 PM INFO  com.inversoft.configuration.BasePropertiesFileInversoftConfiguration - 
      
      "
      Informational," 
      
      "
      Informational," 2021-03-06 3:14:10.474 PM WARN  com.inversoft.configuration.BasePropertiesFileInversoftConfiguration - Your FusionAuth configuration file [/usr/local/fusionauth/fusionauth-app/apache-tomcat/../../config/fusionauth.properties] needs attention. Here are the warnings:
      
      "
      Informational,"   - You are using a deprecated configuration property name of [DATABASE_USER]. The new allowed names for that property are [database.username]
      
      "
      Informational," 
      
      "
      Informational," 2021-03-06 3:14:10.488 PM INFO  org.primeframework.mvc.servlet.PrimeServletContextListener - Initializing Prime
      
      "
      Informational," 
      
      "
      Informational," 
      
      "
      Informational," ---------------------------------------------------------------------------------------------------------
      
      "
      Informational," ---------------------------------------------------------------------------------------------------------
      
      "
      Informational," --------------------------------- Starting FusionAuth version [1.23.3] ----------------------------------
      
      "
      Informational," 2021-03-06 3:14:10.490 PM INFO  io.fusionauth.app.guice.FusionAuthModule - 
      
      "
      Informational," 2021-03-06 3:14:11.232 PM INFO  io.fusionauth.api.plugin.guice.PluginModule - No plugins found
      
      "
      Informational," 2021-03-06 3:14:12.272 PM INFO  io.fusionauth.api.service.system.NodeService - Node [8d8c9f0f-2965-44f3-a4bf-4c2aa17e10fa] started.
      
      "
      Informational," 2021-03-06 3:14:14.225 PM INFO  io.fusionauth.api.configuration.DefaultFusionAuthConfiguration - Loading FusionAuth configuration file [/usr/local/fusionauth/config/fusionauth.properties]
      
      "
      Informational," 2021-03-06 3:14:14.229 PM INFO  io.fusionauth.api.configuration.DefaultFusionAuthConfiguration - Dynamically set property [fusionauth-app.url] set to [http://172.16.0.2:9011]
      
      "
      Informational,"   - Overriding default value of property [search.type] with value [database]
      
      "
      Informational," 
      
      "
      Informational," 2021-03-06 3:14:14.232 PM INFO  com.inversoft.configuration.BasePropertiesFileInversoftConfiguration - 
      
      "
      Informational,"   - Overriding default value of property [database.mysql.connector-version] with value [8.0.21]
      
      "
      Informational,"   - Overriding default value of property [database.mysql.enforce-utf8mb4] with value [true]
      
      "
      Informational,"   - Overriding default value of property [FUSIONAUTH_APP_RUNTIME_MODE] with value [production]
      
      "
      Informational," 
      
      "
      Informational,"   - You are using a deprecated configuration property name of [DATABASE_USER]. The new allowed names for that property are [database.username]
      
      "
      Informational," 2021-03-06 3:14:14.232 PM WARN  com.inversoft.configuration.BasePropertiesFileInversoftConfiguration - Your FusionAuth configuration file [/usr/local/fusionauth/fusionauth-app/apache-tomcat/../../config/fusionauth.properties] needs attention. Here are the warnings:
      
      "
      Informational," 
      
      "
      Informational," 2021-03-06 3:14:14.250 PM INFO  com.inversoft.jdbc.hikari.DataSourceProvider - Connecting to PostgreSQL database at [jdbc:postgresql://fusionauth.postgres.database.azure.com/fusionauth]
      
      "
      Informational," 2021-03-06 3:14:14.254 PM WARN  com.zaxxer.hikari.HikariConfig - HikariPool-1 - idleTimeout has been set but has no effect because the pool is operating as a fixed size pool.
      
      "
      Informational," 2021-03-06 3:14:14.258 PM INFO  com.zaxxer.hikari.HikariDataSource - HikariPool-1 - Starting...
      
      "
      Informational," 2021-03-06 3:14:15.822 PM INFO  com.zaxxer.hikari.HikariDataSource - HikariPool-1 - Start completed.
      
      "
      Informational," 2021-03-06 3:14:17.904 PM INFO  com.inversoft.scheduler.DefaultScheduler - Starting up scheduler
      
      "
      Informational," 2021-03-06 3:14:17.905 PM INFO  com.inversoft.scheduler.DefaultScheduler - Scheduler is running
      
      "
      Informational," 2021-03-06 3:14:23.153 PM INFO  io.fusionauth.api.service.system.NodeService - Node [8d8c9f0f-2965-44f3-a4bf-4c2aa17e10fa] added with address [http://172.16.0.2:9011]
      
      "
      Error," 06-Mar-2021 15:14:24.481 INFO [main] org.apache.coyote.AbstractProtocol.start Starting ProtocolHandler [""http-nio-9011""]
      
      "
      Error," 06-Mar-2021 15:14:24.519 INFO [main] org.apache.coyote.AbstractProtocol.start Starting ProtocolHandler [""https-jsse-nio-9013""]
      
      "
      Error," 06-Mar-2021 15:14:24.540 INFO [main] org.apache.coyote.AbstractProtocol.start Starting ProtocolHandler [""ajp-nio-127.0.0.1-9019""]
      
      "
      Error," 06-Mar-2021 15:14:24.564 INFO [main] org.apache.catalina.startup.Catalina.start Server startup in 19580 ms
      
      "
      
      
      posted in Q&A
      V
      venkata.dorisala
    • RE: Multiple nodes sharing IP address?

      @dan

      We do see node start times in the logs. But can't see the reason for the node restart. Everyday fusion nodes(pods) are being restarted.

      Trying to understand the highlighted portion from the logs. Due to some reason NodeService is removing node. The node was just started and within a minute it was removed. So it may not be due to health of the node.

      6d6e3211-deff-4a52-b6b1-d09926d92d93-image.png

      Can't find much information about setting up fusionauth multi-node cluster. Can you share the links for such documentation.

      posted in Q&A
      V
      venkata.dorisala
    • RE: Multiple nodes sharing IP address?

      Hello @dan

      Yes i do see those statements in the logs. Below are the statements for 2 nodes. They do have different Node Id and ip address.

      io.fusionauth.api.service.system.NodeService - Node [c5ae863b-1e86-4858-8516-3dfc93866c04] added with address [http://10.1.0.39:9011]

      io.fusionauth.api.service.system.NodeService - Node [4af1d532-79a7-4a5c-b45a-4e1e7338a1fb] added with address [http://10.1.0.53:9011]

      posted in Q&A
      V
      venkata.dorisala
    • RE: Multiple nodes sharing IP address?

      @dan said in Multiple nodes sharing IP address?:

      FUSIONAUTH_APP_URL

      @dan
      We have configured fusion to run with 2 replicas. So it spins up 2 pods all the times in 2 different nodes.
      We are setting FUSIONAUTH_APP_URL as http://POD_IP:9011 . So it's different for both pods where fusionauth is running. Attached screenshot of both pods env variable.
      169cff25-9e3f-41c0-bab4-8b7b339d4be6-image.png

      We don't have any security policies so within the cluster all nodes can access each other.

      We are still seeing the same IP address for both nodes in fusion UI as shown in screenshot provided by @davidmw

      Please let me know if you need anything else from us to troubleshoot this issue further.

      posted in Q&A
      V
      venkata.dorisala